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Preface 


Synthetic protein switches with custom response functions have become invaluable tools in 
basic research and biotechnology for monitoring biomolecular analytes or actuating cellular 
functions in a rapid, specific, integrated, and autonomous fashion. This book provides a 
comprehensive summary of state-of-the-art protocols to facilitate the construction of syn- 
thetic protein switches for a variety of applications in biotechnology and basic research. 
Protocols are applicable to life scientists from diverse research fields that range from tradi- 
tional, discovery-centered disciplines such as cancer research to newly emerging disciplines 
such as synthetic biology. 

Chapters are grouped into separate sections focusing on different types of switches, 
sensors, and actuators. Starting with a general view, I first discuss the experimental chal- 
lenges and theoretical considerations that underlie the construction of synthetic protein 
switches, also highlighting an increasing number of computational approaches which aim 
to render the design cycle more rational and therefore more efficient. In the second chap- 
ter, Ha and Loh provide an overview on the construction of synthetic protein switches by 
means of alternative frame folding and intermolecular fragment exchange which promises a 
generic route to convert any conventional binding receptor or enzyme into an allosterically 
regulated protein switch. This is followed up by a detailed protocol by Ribeiro, Ostermeier, 
et al. on the construction of synthetic protein switches by means of domain insertion 
describing the underlying non-homology-dependent DNA recombination process to build 
DNA libraries. 

Subsequent chapters become increasingly specific, providing case studies on how to 
engineer synthetic protein switches for different types of applications. Starting with protocol 
chapters that describe the construction of fluorescent and bioluminescent sensors, Mitchell, 
Jackson, et al. and Clifton, Jackson, et al. demonstrate how computational strategies based 
on molecular modeling and statistical sequence analysis can be applied to engineer small 
molecule FRET sensors with enhanced biophysical properties. Farrants, Johnsson, et al. then 
describe a general route toward small molecule sensors based on semisynthetic fluorescent 
and bioluminescent sensors that are built with the SNAP-tag protein conjugation system. 
Finally, Nyati et al. and Matysuma, Ueda, et al. illustrate the construction of bioluminescent 
sensors based on proximity-dependent and allosterically regulated firefly luciferases. 

Beyond fluorescent and bioluminescent sensors, three chapters by Iwai et al., Wouters 
et al., and Nirantar et al. focus on the construction of synthetic protein switches based on 
B-lactamase, which has served as a model enzyme for pioneering a number of design strate- 
gies, for instance, by means of domain insertion and competitive autoinhibition. This is 
followed up by two chapters that describe the construction of protease-based switches as 
Wintgens, Wehr, et al. and Stein and Alexandrov illustrate how viral proteases can be reen- 
gineered into synthetic protease sensors with custom input-output functions based on split- 
and competitively autoinhibited architectures. 

The book concludes with chapters focusing on the construction of protein switches 
that can actuate biological signaling functions in live cells. To this end, Muehlhaecuser, 
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Radzwilli, et al.; Stabel, Moeglich, et al.; Cosentino, Moroni, et al.; and Taxis provide pro- 
tocols on how to regulate protein kinase function, ion channel permeability, and protein 
degradation by means of light-regulated protein switches. This is followed up with protocol 
chapters by Castillo, Ghosh, et al. and DiRoberto, Peisajovich, et al. who devise strategies 
for regulating cellular signal transduction systems through biologically inert ligands and 
rewiring key nodes of intracellular signaling systems. 


Darmstadt, Germany Viktor Stein 
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General Strategies and Considerations 


Chapter 1 


Synthetic Protein Switches: Theoretical 
and Experimental Considerations 


Viktor Stein 


Abstract 


Synthetic protein switches with tailored response functions are finding increasing applications as tools in 
basic research and biotechnology. With a number of successful design strategies emerging, the construction 
of synthetic protein switches still frequently necessitates an integrated approach that combines detailed bio- 
chemical and biophysical characterization in combination with high-throughput screening to construct tai- 
lored synthetic protein switches. This is increasingly complemented by computational strategies that aim to 
reduce the need for costly empirical optimization and thus facilitate the protein design process. Successful 
computational design approaches range from analyzing phylogenetic data to infer useful structural, bio- 
physical, and biochemical information to modeling the structure and function of proteins ab initio. The 
following chapter provides an overview over the theoretical considerations and experimental approaches that 
have been successful applied in the construction of synthetic protein switches. 


Key words Protein switches, Protein engineering, Synthetic biology, Protein signaling, Genetic 
circuits 


1. Introduction 


Synthetic protein switches with tailored response functions are 
finding increasing applications as tools in basic research helping 
dissect the molecular mechanisms that underlie the function of a 
cell, or in biotechnology as diagnostic reagents reporting in an 
autonomous fashion on distinct molecular biomarkers that are spe- 
cific for a disease process [1, 2]. Common to all synthetic protein 
switches is a receptor that recognizes a distinct molecular queue 
(such as ligand binding or a posttranslational modification) and an 
actuator that is functionally coupled to the receptor and thus able 
to translate the primary molecular recognition event into a change 
in biophysical, chemical, or enzymatic signal depending on the 
preferred readout. 

At the molecular level, a number of architectures have been 
successfully devised to construct synthetic protein switches with 
tailored response functions: These range from integrated designs 
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featuring allosteric-binding receptors that are inserted into the ter- 
tiary structure of an actuator such as a fluorescent protein (FP) or 
an autoinhibited enzyme module, to modularly organized binding 
receptors and actuators where independently folding functional 
domains are organized along a linear polypeptide chain. For inte- 
grated designs, a binding event is typically transduced from the 
receptor to the actuator through a complex network of conforma- 
tional transitions in the tertiary structure of a protein. In contrast, 
modularly organized synthetic protein switches are typically regu- 
lated through mutually exclusive binding interactions where con- 
formational transitions are limited to the linkers connecting 
independently folding functional domains. Beyond single-compo- 
nent protein switches, synthetic protein switches can also be com- 
posed of multiple molecularly distinct components. These are 
typically regulated through the induced proximity of a transducer 
with an actuator or two split protein halves [3, 4]. 

In terms of specific applications, synthetic protein switches are 
increasingly employed as intracellular sensors that monitor molecu- 
lar functions in an integrated and autonomous fashion in real time, 
e.g., reporting on the presence or absence of key metabolites, pro- 
tein-protein interactions, or posttranslational modifications based 
on fluorescence or bioluminescence readouts [5-8]. In compari- 
son, conventional techniques that have traditionally been employed 
to analyze protein-associated functions by means of antibodies or 
mass spectrometry only provide snapshots of molecular states as 
cells and tissues need to be broken up and/or fixed for analysis. In 
this case, monitoring time courses of biological processes based on 
successive time points quickly becomes laborious and also intro- 
duces variability from repeated sampling. Beyond applications in 
basic research, synthetic protein switches are increasingly devel- 
oped as diagnostic reagents to detect clinically important biomark- 
ers in an integrated fashion with no need for laborious work-up 
steps such as the successive binding and washing steps necessitated 
by immunological techniques based on antibodies. 

Beyond applications as molecular sensors, synthetic protein 
switches can also be employed to actuate biological functions [9- 
12]. Traditionally, this has been realized through small molecular 
weight ligands that can bind and thus control the function of key 
signaling proteins inside the cell. In the majority of cases, small 
molecular weight ligands primarily inhibit protein-associated func- 
tions. In contrast, synthetic protein switches can regulate cellular 
functions in both positive and negative ways, for instance, by 
introducing artificial control elements into key regulatory proteins 
of intracellular signal transduction pathways. 

With a number of applications emerging in basic research 
and biotechnology, a key bottleneck has been to devise gener- 
ally applicable strategies to engineer synthetic protein switches 
with tailored response functions [1, 2, 13, 14]. Notably, current 
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Fig. 1 Summary of the key experimental steps in the design-build-test cycle of 
synthetic protein switches. The design of synthetic protein switches is based on 
structural intuition that is increasingly complemented by computer-assisted 
design processes based on the molecular modeling of protein structures and sta- 
tistical sequence analysis that aim to render the design stage more rational and 
automated (see Subheading 3). Once designed, synthetic protein switches can be 
built using a variety of DNA assembly procedures that include DNA homology and 
non-homology-dependent recombination methods as well as ligation-dependent 
strategies (see Subheading 4). Individual designs are then tested empirically for 
their correct function. Depending on the likelihood that designs are correct, tai- 
lored screening systems of varying throughput are required (see Subheading 5) 


design strategies extensively rely on iterative cycles of designing, 
building, and testing synthetic protein switches (Fig. 1) with the 
emphasis on empirical testing that is costly and time-consuming. 
The following chapter thus provides a summary of the key experi- 
mental techniques and theoretical considerations that apply to the 
construction of synthetic protein switches. 


2 Designing Synthetic Protein Switches 


A key goal in synthetic biology is to engineer biological functions 
a priori [15, 16]. This is to accelerate the design-build-test cycle 
and reduce the need for costly empirical optimization. In addition, 
a capacity to engineer biological functions a priori reflects on our 
fundamental understanding of the underlying biological processes 
and phenomena. In the context of proteins, significant progress 
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2.1 Design 
by Molecular Intuition 


has been made in the computational design of protein structures, 
protein assemblies, protein-protein interactions, ligand and sub- 
strate specificities, as well as catalytic mechanisms [17-21 ]-yet, 
progress in the computational design of synthetic protein switches 
with tailored response functions has been limited. Notably, syn- 
thetic protein switches are dynamic entities and undergo confor- 
mational transitions that are critically important for their function, 
yet challenging to analyze and even more challenging to predict, 
control, and engineer in a systematic fashion. The majority of syn- 
thetic protein switches have thus been designed based on an intui- 
tive molecular understanding of protein structure and function 
while computational strategies increasingly assist in the rational 
optimization of key functional or biophysical properties. 


The protein database (PDB) features over 120,000 solved protein 
structures that can be exploited for the structure-guided engineer- 
ing of protein switches by (semi-)rationally recombining binding 
receptors with enzymes, fluorescent, or bioluminescent proteins. 
Protein structures are readily accessible through structural visual- 
ization programs such as PyMol (DeLano WL, 2002 The PYMOL 
Molecular Graphics System) that provide an indispensable design 
aid. For instance, in domain insertion strategies, an allosteric 
receptor is typically inserted into surface exposed loop regions 
such that ligand-induced conformational changes are efficiently 
transmitted to the actuator modulating its function. In this way, 
synthetic protein switches and sensors have been engineered based 
on GFP [22-24], B-lactamase [25-27], tyrosine protein kinases 
[28-30], xylanase [31], and PQQ-dependent glucose dehydroge- 
nase (GDH) [32]. Similarly, alternative frame folding relies on a 
thorough structural analysis to identify, duplicate, and modify 
structural elements that are important for the binding or catalytic 
function of a synthetic protein switch [33-35 |. Structurally related 
to synthetic protein switches engineered by domain insertion are 
split protein complementation sensors that reassemble into a func- 
tional protein upon induced localization of the two protein halves. 
Here, structural intuition frequently guides the choice of the split 
sites that separate a protein into two structurally well-defined 
subdomains. In this way, it has been possible to reengineer a num- 
ber of split luciferases to report on intracellular signaling events 
[36-39], split tobacco etch virus (TEV) proteases to sense and 
actuate intracellular signaling function [40-43], split tyrosine pro- 
tein kinases to actuate cellular signaling functions and screen for 
drugs [44, 45], and split PQQ-dependent GDH as a universal 
biosensor platform [46]. In addition, the construction of modu- 
larly organized protein switches based on structurally distinct allo- 
steric receptors and actuators benefits from high-resolution 
structural information as it provides clues about the position and 
relative orientation of the N- and C-termini that assist in the 
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construction of the connecting linkers and facilitate rapid diversi- 
fication of input functions. Notably, many intracellular signal 
transducers are organized in a modular fashion that facilitates 
rewiring the response functions of bacterial [47—49] and eukary- 
otic [50-55] signal transducers or the construction of genetically 
encoded [56-61] and semisynthetic protein sensors [62-67]. 
Visual inspections of protein structures are, however, relatively 
crude design strategies that are nonquantitative, rely on manual 
assessment, and frequently need to be optimized empirically 
through experimental screening. Ideally, the function of a syn- 
thetic protein switch can be engineered computationally in an 
automated fashion based on quantitative parameters, which also 
reflects on our fundamental understanding how protein sequence 
relates to protein structure and function. 


Toward this goal, a number of computational strategies have been 
pursued to analyze and engineer structural and functional proper- 
ties of a protein a priori by means of computational design 
[68-70]. In its most elementary form, molecular dynamic simula- 
tions compute the behavior of an ensemble of molecules based on 
the physical forces that every single atom is subject to. Such high- 
resolution models are however computationally expensive, and in 
practice take prolonged periods of time to model the structure or 
the conformational dynamics of proteins. As a result, molecular 
dynamics simulations are primarily restricted to analytical studies 
and thus not suited to iterate through large numbers of protein 
mutants as necessitated in rational protein design. 

Instead, increasing grades of abstraction and simplification are 
introduced aiming to limit the conformational search space and 
accelerate computation times [68-70]. This usually requires iden- 
tifying, approximating, and weighing the key parameters that 
underlie a structural, biophysical, or functional property. Specific 
simplifications include restricting the dihedral angles of the poly- 
peptide backbone and amino acid sidechains to the most frequently 
occurring rotamers (in the same way structural biologists match 
the tertiary structure of a protein to its electron density map) or 
approximating secondary structure propensities, solvation terms, 
electrostatic energies, and hydrogen bond potentials. This is 
increasingly complemented by bioinformatic approaches mining 
protein structures for functional motifs that can be grafted onto a 
desired binding or enzyme catalyzed reaction. 

In this way, a number of new protein structures and functions 
could be computationally engineered including new folds [71, 72], 
new ligand and substrate specificities |73, 74], as well as new cata- 
lytic functions [75, 76]. In contrast, predictably engineering the 
conformational transitions that underlie the switch-like behavior of 
synthetic protein switches has proven more difficult and primarily 
relied on redesigning individual properties. In one recent example, 
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2.3 Design 
by Statistical 
Sequence Analysis 


an allosterically regulated Ca**-sensitive Kemp Eliminase was engi- 
neered by introducing a binding site and reactive groups for a 
Kemp Eliminase reaction into the EF hand of calmodulin, while 
preserving its natural propensity to undergo a conformational tran- 
sition from compact to extended upon binding Ca’* [77 ]. Similarly, 
the ligand specificity of the bacterial transcription factor LacI was 
computationally reengineered to recognize fucose, gentiobiose, 
lactitiol, and sucralose [78], while preserving the natural propen- 
sity of LacI to bind DNA in a ligand-dependent fashion. However, 
preserving natural allosteric transitions while introducing new 
ligand specificities is nontrivial, and in case of bacterial transcrip- 
tion factors additionally involved experimental screening and selec- 
tion of a large library of mutant protein switches [78]. 

In contrast, predictably engineering the conformational transi- 
tions that underlie synthetic protein switches have so far met with 
limited success. This particularly applies to integrated designs, 
where allosteric changes are regulated through complex networks 
of amino acids in the tertiary structure of a protein that are difficult 
to recapitulate in a rational manner. In contrast, for modularly 
organized protein switches with structurally distinct receptor, actu- 
ator, and Al-domains, the behavior of the connecting linkers can 
be described with synthetic polymer models to assist balancing 
steric strain in ligand-bound and unbound conformational states. 
In one example, the worm-like chain (WLC) model was success- 
fully applied to quantify the behavior of Gly-Ser-rich linkers con- 
necting two FPs undergoing resonance energy transfer in a 
Zn?*-specific protein sensor [79]. Yet, these models have so far 
primarily been used to rationalize the behavior of a linker post- 
experimentally, but not engineer linkers a priori. 


Beyond structure-guided protein engineering, the evolutionary 
history of proteins provides a rich source of information that can 
be computationally analyzed to derive useful functional and bio- 
physical properties of proteins. Notably, next-generation sequenc- 
ing technologies have generated an unprecedented wealth of 
sequence data that provides a detailed snapshot on the evolution of 
proteins and protein families. This data is increasingly mined and 
analyzed using sophisticated computational algorithms to extract 
valuable information on how the primary structure of a protein 
correlates with key biophysical and functional properties. 

In the simplest case, the consensus sequence of a protein can 
highlight functionally and structurally important residues that are 
conserved within a protein family [80]. Enriching proteins with 
conserved consensus motifs has previously been shown to improve 
their thermal and conformational stability that constitutes a critical 
parameter in the development of recombinant proteins for many 
biotechnological applications including therapeutic binding agents 
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[81-83] or enzymes for large-scale, industrial biosynthesis [84, 85 ]. 
It is worth noting that the consensus sequence of a protein does not 
yield a true protein sequence, but an averaged one which neglects 
that individual mutations are subject to epistatic effects [86]. This 
means, depending on their context, combination of mutations can 
have synergistic, neutral, or detrimental effects on a specific struc- 
tural, biophysical, or function property. Considering this correla- 
tion is lost in the consensus sequence, the resulting proteins are not 
necessarily functional and, thus, frequently have to be correlated 
with additional sequence, biochemical, biophysical, or structural 
information to yield proteins with the desired properties. 

In contrast to the consensus sequence approach, ancestral gene 
resurrection (AGR) aims to identify the true sequence of a primor- 
dial protein [87, 88]. This approach is unique in that it allows to 
resurrect and experimentally study extinct proteins. Notably, from 
a protein engineer’s perspective, ancestrally resurrected proteins 
display a number of superior properties over their contemporary 
counterparts. This includes superior folding, improved thermody- 
namic stability [89-91], and greater levels of substrate promiscuity 
[89, 92], which, in the context of engineering synthetic proteins, 
has already been exploited to reengineer the ligand specificity of 
allosteric binding receptors [92 ]. Similar to the consensus sequence 
approach, the evolutionary tree of a protein family is retraced based 
on multiple sequence alignments and different statistical methods. 
These include maximum likelihood, maximum parsimony, or 
Bayesian reconstruction to calculate the posterior probability of a 
protein sequence at every evolutionary branch point. While the 
specific evolutionary ancestral resurrection algorithm is frequently 
of debate—especially, if the true ancestral sequence of a protein is 
to be determined in the context of evolutionary studies-this is a 
lesser concern in protein engineering as long as the resurrected 
protein sequences yield improved functional or biophysical 
properties. For instance, AGR has been employed to improve the 
thermodynamic and folding efficiency of L-arginine-specific peri- 
plasmic-binding proteins (PBPs). This turned out critical for their 
efficient recombination with FPs to engineer L-arginine-specific 
FRET sensors [90] and also facilitated their subsequent reengi- 
neering into L-glutamine-specific FRET sensors [92]. 

Finally, statistical coupling analysis (SCA) has been successfully 
applied to identify co-evolving networks of residues that are distant 
in primary, but continuous in tertiary structure highlighting 3D 
hotspots that are functionally coupled in a protein [93, 94]. 
Notably, recombining AsLov2 and PDZ receptor domains with 
dihydrofolate reductase (DHFR) via computationally predicted 
allosteric hotspots yielded a regulated enzyme that transduces 
light- and ligand-induced conformational transitions from the 
receptor to the actuator [95]. 
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3 Building Synthetic Protein Switches 


3.1. Non-Homology- 
Dependent 
Recombination 
Strategies 


3.2 Homology- 
Dependent 
Recombination 
Strategies 


Considering computational approaches can only optimize a lim- 
ited number of biophysical and functional properties in a protein; 
this means the construction of synthetic protein switches relies, to 
a significant extent, on empirical optimization based on medium- 
to high-throughput screening assays. As a general rule of thumb, 
synthetic protein switches generated by means of random domain 
insertion rely on higher throughput screening approaches due to 
the less predictable effect of recombining two structurally well- 
defined protein domains on fold, structure, and function. In con- 
trast, modularly organized synthetic protein switches can be 
engineered in a more rational manner solely focusing on the length 
and structure of the linkers connecting individual domains. 
Consequently, each of the individual design strategies imposes dif- 
ferent challenges on the underlying DNA assembly process. 


Before the advent of highly affordable synthetic DNA, random 
domain insertions were created following a limited endonuclease 
digest of a circular DNA construct coding for an actuator and sub- 
sequent fusion with a linear DNA construct coding for an allosteric 
receptor. The latter may also be circularly permutated resulting in 
a set of new N- and C-termini which potentially enhances the 
transmission of conformational changes between the receptor and 
the actuator; these are not necessarily confined to the original N- 
and C-termini, but most pronounced at internal sites [96]. The 
resulting libraries are then empirically screened for domain inser- 
tion mutants that are functionally recombined in allosteric hotspots 
(c.f. SCA that aims to predict allosteric hotspots as opposed to 
experimentally screen for them). This strategy has, for instance, 
been successfully applied to engineer a number of allosterically 
regulated enzymes, including maltose regulated B-lactamase [25, 
97, 98], xylose regulated xylanase [99], and HIF1-binding domain 
cytosine deaminase [100]. Considering only 1 in 6 constructs are 
in frame and the unpredictable effect of domain insertion of pro- 
tein structure and function, a large number of domain insertion 
mutants need to be screened using a suitable high-throughput 
screening assay. These can either be directly screened for functional 
protein switches, e.g., based on antibiotic resistance conferring 
B-lactamases [97 ] or in case of more technically challenging enzyme 
assays fused with GFP to identify in frame, non-homologously 
recombined genes before assaying for the relevant enzyme func- 
tion in a multiwell plate assay format [99]. 


Alternatively, more focused DNA insertion libraries can be created 
by means of homology-dependent DNA cloning methods over- 
coming the limitations associated with out-of-frame insertions: 
e.g., overlap extension PCR (OE-PCR) constitutes one of the 


3.3 Ligation- 
Dependent Assembly 
Strategies 
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earliest homology-dependent recombination methods [101, 102] 
and has recently been applied to engineer defined linker libraries 
for light-activated histidine protein kinase switches [103]. Here, a 
small number of DNA templates with overlapping homologous 
sequences prime each other during every reannealing step to 
recombine two DNA fragments. Recombination by means of 
OE-PCR can, however, prove technically challenging considering 
the relatively low efficiency of recombination between two larger 
single-stranded DNA fragments. This is further aggravated by the 
exponential nature of PCR amplification, which potentially renders 
OE-PCR susceptible to nonspecific DNA amplification products 
and limits the number of DNA fragments that can be simultane- 
ously recombined. 

More recently, Gibson assembly has originated as a powerful, 
homology-dependent cloning strategy relying on the combined 
action of a dsDNA 5’ to 3’ exonuclease, a thermostable DNA ligase 
and a thermostable DNA polymerase [104]. Reactions are typically 
conducted at 50 °C and initiated by the exonuclease-dependent 
chew back of the 5’ end. This results in the formation of single- 
stranded 3’ DNA extensions that guide the reannealing of homolo- 
gous DNA sequences that are subsequently extended and filled by 
the DNA polymerase and eventually sealed by the DNA ligase. 
Unlike OE-PCR, Gibson Assembly occurs at a constant tempera- 
ture without the need for thermal cycling coordinating successive 
reannealing and amplification steps. This significantly increases the 
efficiency of recombination, enables the simultaneous assembly of 
multiple DNA fragments, and prevents any bias that may arise 
through successive reannealing and amplification cycles. While 
technically easy, the efficiency of Gibson assembly can be reduced 
by secondary structures, repeat regions and GC-rich regions as they 
frequently occur in the glycine- and serine-rich polypeptide linkers 
as is applicable in the construction of synthetic protein switches. 

Beyond Gibson assembly, a number of alternative methods 
have been devised that rely on similar principles such as sequence 
and ligase-independent cloning (SliC) [105], circular polymerase 
extension cloning (CPEC) [106], seamless ligation cloning extract 
(SLICE) [107], or AQUA [108] where an exonuclease, DNA 
polymerase and DNA ligase function are included either as part of 
cell extract or within a cell. 


While OE-PCR and Gibson assembly enable the seamless assembly 
of DNA sequences independent of restriction sites, both methods 
rely on homologous DNA sequences of 20-50 bp. This generally 
restricts the reuse of DNA coding for common receptor, actuator, 
and linker elements from existing, sequence verified DNA con- 
structs and libraries. In addition, the longer the overhangs, the 
more expensive the synthesis of tailored oligonucleotides becomes. 
Alternatively, cloning strategies have been devised based on type 
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IIS restriction enzymes. These cut outside their recognition motif 
in a sequence-independent fashion to create tailored single- 
stranded DNA extensions. Crucially, unlike conventional restric- 
tion enzymes, the resulting single-stranded extensions are 
non-palindromic and thus facilitate the assembly of multiple DNA 
fragments in a directional manner. 

Golden Gate cloning constitutes one of the most widely used 
DNA assembly methods based on type IIS restriction enzymes 
allowing for the directional and seamless assembly of multiple 
DNA fragments [109]. In the context of engineering synthetic 
protein switches, distinct structural motifs, linker elements, and 
functional domains are first amplified by PCR using synthetic oli- 
gonucleotides that introduce tailored DNA overhangs. These 
overhangs code for a type IIS restriction site and a short recombi- 
nation motif that guide the ligation of multiple DNA fragments 
with complementary extension motifs. Individual DNA fragments 
are then fused following the combined action of a type HS restric- 
tion enzyme and a DNA ligase. One key disadvantage of type IIS 
restriction enzyme-dependent cloning strategies is the need to 
remove any potential restriction sites in the coding sequence. 
While this does not pose a concern for synthetic DNA fragments, 
where restriction sites can be specifically omitted, this is not the 
case with genomic sequences and DNA constructs that are already 
available in the plasmid database of a lab. 

Alternatively, USER Enzyme can be employed to create short 
single-stranded 3’ DNA extensions [110-114]. Here, single- 
stranded 3’ DNA extensions are created through the excision of 
uracil residues that are introduced via synthetic oligonucleotides at 
the PCR amplification step. The resulting 3’ DNA extensions sub- 
sequently guide the DNA ligase-dependent fusion of two or more 
DNA fragments. Scar sites are minimal as the only sequence 
requirement is a pair of A and T residues spaced apart by approxi- 
mately two to six nucleotides. Similar to type HS restriction sites, 
the single-stranded extensions of USER enzyme can be non- 
palindromic to enable the directional assembly of multiple DNA 
fragments. 

Ultimately, the preferred DNA assembly procedure will be 
determined by a number of factors: This includes the architecture 
of a specific protein switch (e.g., whether it is modularly organized 
or integrated), the source of DNA (e.g., whether it is of genomic 
or synthetic origin), as well as any idiosyncrasies associated with the 
construction of a particular protein switch (e.g., whether linker 
regions feature repeat regions, secondary structures, or high GC 
content). In addition, the potential for automation and the use of 
commercial DNA synthesis and cloning services plays an increas- 
ingly important consideration in devising cost-effective and effi- 
cient DNA assembly processes and needs to be assessed individually 
for different types of synthetic protein switches. 
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4 Testing Synthetic Protein Switches 


4.1 Engineering 
Subcomponents Using 
Display Technologies 


Historically, biotechnological innovation has extensively relied 
on experimental trial-and-error to adopt and reengineer existing 
biological functions toward specific applications. This particularly 
applies to the rational engineering of protein-associated functions 
which has been hampered by an insufficient understanding how 
the sequence of a protein relates to its function. Consequently, an 
increasing number of studies are breaking down the construction 
of synthetic protein switches into manageable substeps. This 
includes limited empirical optimization to engineer or optimize 
key functional properties such as the binding specificity of recep- 
tors and Al-domains, as well as their subsequent assembly into 
functional protein switches with tailored response functions. The 
latter is generally supported by medium- and high-throughput 
screening assays based on miulti- and_= single-cell assay 
technologies. 


The construction of modularly organized receptors and actuators, 
where allosteric transitions are primarily mediated by flexible linker 
regions, has raised the possibility of constructing synthetic protein 
switches from individual subcomponents based on structurally 
well-defined binding domains that either recognize the target 
ligand or modulate the output of the actuator. For instance, GFP 
and its engineered derivatives have a propensity of dimerizing with 
uM affinity which has been shown to enhance the sensitivity and 
dynamic range of FRET-based fluorescent sensors [115, 116]. 
Similarly, a number of enzymes feature naturally occurring, geneti- 
cally encoded inhibitors that can be exploited for the construction 
of synthetic protein switches based on the autoinhibited B-lactamase 
module [56, 117, 118]. In the absence of structural information 
or the presence of naturally occurring receptor and AlI-domains, 
highly specific protein-based binders that either recognize the tar- 
get molecule or associate with the actuator to modulate its func- 
tion can either be constructed de novo or sourced from natural 
sources and optimized using a variety of display technologies such 
as phage [119], yeast [120, 121], and various in vitro display tech- 
nologies that either feature RNA [122-124] or DNA [125-129] 
as the coding nucleic acid. 

Collectively, these systems display a protein either on the sur- 
face of either phage or yeast or in vitro directly on its coding nucleic 
acid maintaining a physical association between genotype (i.e., its 
coding nucleic acid) and phenotype (1.e., the protein binder that 
mediates its binding function). Depending on the type of display 
system, the target ligand can be immobilized on a solid surface 
retaining and enriching those phage or nucleic acids that code for 
a functional binder. Alternatively, the target ligand can be labeled 
with a fluorescent reporter molecule labeling those cells or p!-beads 


14 Viktor Stein 


4.2 Assembling 
Synthetic Protein 
Switches 


that display a functional binder and enriching them by means of 
fluorescence activated cell sorting (FACS). 

Beyond choosing a suitable display system, the second major 
consideration concerns the scaffold protein to construct tailored 
protein binders. Historically, the development of next-generation 
biologics has yielded a diverse repertoire of recombinant binding 
scaffolds as alternatives to monoclonal antibodies [130-132]. 
These typically comprise independently folding, single-chain pro- 
tein domains and short peptide motifs with more or less defined 
structural propensities. Crucially, these newly developed binding 
scaffolds can be readily produced in Escherichia coli, fused to 
additional protein domains and generally display superior struc- 
tural, folding, and thermodynamic properties that facilitate their 
purification, biophysical characterization, and integration into 
modularly organized synthetic protein switches. 

In one recent example, an allosteric binding receptor was 
constructed by means of phage display fusing a circularly permu- 
tated PDZ domain with an engineered fibronectin (FN) scaffold 
that serves as an enhancer domain [57, 59]. The two domains 
are connected through a Gly-Ser rich linker, which is unstruc- 
tured in the ligand unbound state, but forms a structurally well- 
defined sandwich complex in the ligand-bound state. Biophysical 
studies have also shown that formation of the sandwich complex 
is associated with a distinct movement of the receptor domain. 
This was subsequently exploited to create fluorescence and pro- 
tease-based switches following recombination of the affinity 
clamp receptor with fluorescent proteins [58] and autoinhibited 
protease modules [60]. 


Arguably, the most technically challenging aspect in the construc- 
tion of synthetic protein switches is to recombine individual sub- 
components (e.g., the binding receptor, the actuator, and 
Al-domains) into fully functional protein switches with tailored 
response functions. Depending on the type of switch, this requires 
testing a varying number of designs over successive screening and 
selection cycles while looking to optimize their input-dependent 
switching behavior. Experimentally, this is the most labor-inten- 
sive step and, apart from designing a particular synthetic protein 
switch (see Subheading 2), the most creative one considering for 
every different actuator a tailored screening assay needs to be 
devised. 

As a rule of thumb, the higher the throughput, the more tech- 
nically challenging it becomes to establish a suitable screening 
assay. This particularly applies to synthetic protein switches that are 
ideally screened in positive and negative selection modes looking 
to identify those switches that display the largest differential func- 
tion in the presence and absence of a desired target analyte. 
Considering the majority of synthetic protein switches actuate 


4.3 Multi-cell 
Screening in 

Colony- and Multiwell- 
Format 
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their signal either through enzymes or FPs, the preferred readouts 
are based on spectroscopic assays monitoring changes in fluores- 
cence, luminescence, or absorbance. 

In addition, synthetic protein switches are usually composed of 
multiple protein domains. This constitutes a frequently underesti- 
mated factor that imposes constraints on the recombinant expres- 
sion ofa particular class of protein switches as well as their operating 
environment that both have to be accounted for in the design pro- 
cess. For instance, if a particular protein switch is designed to func- 
tion intracellularly, its performance can be limited by cell intrinsic 
factors: e.g., incomplete translation or proteolytic cleavage of flex- 
ible linker regions can limit the expression of a full-length synthetic 
protein switch and ultimately the maximum induction ratio. This 
constitutes less of a concern if a synthetic protein switch is devel- 
oped for in vitro applications where full-length proteins can be 
purified through N- and/or C-terminal purification tags. 


Spectroscopic assays in combination with multiwell plate readers 
constitute one of the most ubiquitous assay formats used to moni- 
tor and measure binding or catalytic functions of several thousands 
of mutants by means of comparatively inexpensive and widespread 
laboratory equipment. Notably, spectroscopic assays that monitor 
changes in fluorescence or absorbance in multiwell plate assays for- 
mats allow for the time resolved measurement of protein function 
and the possibility to duplicate samples within a single plate. The 
latter greatly facilitates quantitative comparisons between synthetic 
protein switches in the presence and absence of a desired target 
analyte (e.g., binding ligand, cofactor, or any other target analyte 
that modulates the activity of the protein switch). Colony screens 
are conceptually similar to multiwell plate assays considering 
microbial colonies on an agar plate comprise thousands of mutants 
that can be screened on average in a cost-efficient manner. The 
only added complication is that assay readouts need to be spatially 
confined to individual colonies, for instance, through a FP or pre- 
cipitating products of an enzyme-catalyzed reaction. 

To assess the function of synthetic protein switches in high 
throughput in either multiwell or colony-based screening formats, 
experimental screening procedures need to be as simple as possi- 
ble, ideally requiring only the sequential addition of reagents with 
no successive washing steps that can introduce comparatively large 
variabilities. Frequently, the target analyte or substrate cannot be 
coexpressed nor readily diffuses across the cell membrane, but 
needs to be added exogenously while a protein needs to be secreted 
or released into the lysate. The former imposes limitations on the 
functional folding of a protein, for instance, if a particular scaffold 
or enzyme naturally folds in the reducing environment of the cyto- 
plasm, it may not efficiently export and fold in the periplasm of 
Escherichia colt. 
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4.4 Single-Cell 
Screening Based 
on FACS 


In practice, these considerations already prove challenging in 
the construction of allosterically regulated FP sensors by means of 
colony-based screening format: e.g., considering Ca** ions do not 
readily diffuse across the Escherichia coli inner plasma membrane, 
this means Ca**-specific single-FP sensors need to be exported to 
the periplasm to bind Ca** [133]. In comparison, genetically 
FRET-based FP sensors that are composed of two FP domains can- 
not be exported to the E. coli periplasm so that the plasma mem- 
brane needs to be selectively permeabilized to allow diffusion of 
Ca** [134]. Beyond throughput, it has also been realized that, for 
optimal performance, synthetic protein switches are preferably 
screened in their operating environment: e.g., Ca**-responsive sin- 
gle-FP sensors for imaging applications in neurobiology are 
increasingly screened in human cell lines such as HEK293 [135]. 
Similarly, HIF 1-responsive cytosine deaminases originally screened 
and optimized in Escherichia coli have subsequently been shown to 
have off-target effects in human cells [136]. 


For higher throughput assays, fluorescent-activated cell sorting 
(FACS) can boost the screening capacity by several orders of mag- 
nitude, while the outcome of a screening and selection experiment 
can be holistically analyzed by means of next-generation sequenc- 
ing. The key difference is that FACS-based selection procedures 
assay the function associated with a single cell as opposed to an 
average output of tens of millions of cells in multiwell plates or a 
colony. This imposes a number of technical challenges on FACS- 
based screening procedures: Firstly, the activity of a synthetic pro- 
tein switch needs to be assayed either inside or directly on the 
surface of a cell. For actuators with catalytic functions, the substrate 
and/or the product thus need to be retained inside or attached on 
the surface of the cell. Secondly, asynchronies in cell growth and 
division as well as bursts in transcription and translation render the 
expression of recombinant proteins stochastic. As a result, expres- 
sion levels and therefore the experimental signal usually vary by an 
order of magnitude across individual cells. To some extent, varying 
expression levels can be normalized over the size of a cell, e.g., by 
normalizing over forward scatter, which is an indicator of cell size, 
but does not allow for the same precision as multiwell plate screen- 
ing assays. This poses challenges if the function of a synthetic pro- 
tein switch only fractionally improves during every design-build-test 
cycle. Furthermore, positive and negative selections need to be per- 
formed in a sequential fashion that provides less precise readouts 
compared to side-by-side comparison in multiwell plates. 
Unsurprisingly, the number of FACS-based screening proce- 
dures that have been successfully devised to construct synthetic 
protein switches is limited. In one recent example, trehalose- 
specific single-FP sensors were engineered in Escherichia coli 
by inserting allosteric, trehalose-specific PBPs into GFP [137]. 


4.5 Growth-Based 
Genetic Selection 
Assays 
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In another example, small molecule-dependent sensors were engi- 
neered based on ligand receptors that are proteolytically degraded, 
but stabilized upon binding their cognate ligand [138]. The result- 
ing ligand sensors can, in turn, be fused either to a fluorescent 
protein or a transcription factor to regulate the activity of a reporter 
gene. Yet, this strategy heavily relies on screening millions of 
mutants in Saccharomyces cerevisae using a suitable high-throughput 
screening procedure, while destabilizing mutations cannot be 
identified computationally. In another example, a protease-based 
screening strategy has been devised in Saccharomyces cerevisae 
based on controlling the export of a reporter protein to the cell 
surface by cleaving off a protein export tag [139]. This screen was 
specifically applied to engineer the substrate specificity of TEV 
protease using sequential positive and negative selection cycles and 
could be readily adapted to screen the function of synthetic protein 
switches in the presence and absence of a target analyte. 

Beyond FACS-based screening procedures, p-droplet-based 
screening strategies have been developed for greater control of 
protein expression and reaction conditions in microfluidic devices 
[140-142]. While technically challenging and not widely available 
in the majority of molecular and cell biology oriented labs, 
u-droplet screening strategies carry a number of advantages that 
are directly applicable to the construction of synthetic protein 
switches: Firstly, the function of a synthetic protein switch includ- 
ing catalytic functions can be assayed extracellularly, which facili- 
tates the control of the reaction conditions. To this end, individual 
proteins can be expressed in Escherichia coli [143, 144], 
Saccharomyces cerevisiae [145, 146], or mammalian cells [147] and 
displayed on the cell surface or released following cell lysis. Droplet 
fusion technology can, in turn, be employed to deliver a substrate 
or a target analyte that is used to analyze a distinct protein func- 
tion. p1-droplet screening strategies are, however, technically chal- 
lenging and limited to laboratories with the specialist expertise, 
especially considering only few studies have successfully screened 
protein function using integrated devices that can fuse droplets, 
deliver reagents, incubate for defined time periods and assay pro- 
tein function such as enzymatic activity [143-147]. 


Beyond spectroscopic readouts based on synthetic or genetically 
encoded fluorescent, luminescent, and absorbant reporter mole- 
cules, selecting for the function of enzyme-based actuators can 
also be directly linked to the survival of a microorganism by 
means of growth-based selection assays. Both Escherichia coli 
and Saccharomyces cerevisiae are suitable for this purpose providing 
a range of antibiotic and auxotrophic markers. The most widely 
used selectable marker in Escherichia coli is based on B-lactamase 
that can also be assayed spectroscopically and has pioneered the 
design of synthetic protein switches by means of domain insertion 
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[25-27, 97, 98, 148], but also mutually exclusive binding interac- 
tions [56, 117, 118]. Saccharomyces cerevisiae also provides a pow- 
erful microorganism for devising high-throughput selection 
procedures based on genetic complementation. In one recent 
example, a light-regulated K* channel was engineered by recom- 
bining a photo-responsive LOV2-Ja domain with the small viral K* 
channel K,, [149]. To this end, synthetic K* channels were screened 
for light-responsiveness in positive and negative selection modes 
following illumination with blue light or in the dark. The selection 
strategy was based on a mutant strain of Saccharomyces cerevisiae 
deficient in endogenous K* channels. Similar genetic complemen- 
tation strategies are conceivable to screen for metabolic functions 
in high throughput through auxotrophic complementation of 
metabolic enzymes in both Escherichia coh and Saccharomyces 
cerevisine. 

Ultimately however, the key technical challenge with growth- 
based selection assays is to control the reaction conditions, in par- 
ticular, the reaction environment, the concentration of individual 
components and the selective pressure. In addition, the growth of 
a particular synthetic protein switch may not exclusively depend on 
its function, but a cell can both adapt genetically and biochemically 
to enhance growth irrespective of a given synthetic protein switch 
mutant. 


Synthetic protein switches are increasingly developed and applied 
both in basic research and biotechnology to monitor biological 
processes in an integrated and autonomous fashion. For now, 
due to our limited understanding to predictively engineer protein- 
associated functions, the construction of tailor-engineered protein 
switches has relied, to a significant extent, on empirical optimiza- 
tion based on high-throughput screening procedures. Suitable 
high-throughput screening and selection procedures are however 
technically challenging to establish, need to be tailored toward spe- 
cific enzyme readouts, and are ideally amenable to positive and 
negative selection modes. This is further hampered by the vast size 
of protein sequence space which generally outstrips our capacity to 
screen and engineer protein-associated functions in high through- 
put. This particularly applies to engineering allostericity that con- 
stitutes one of the most complex and least understood protein 
functions. Computational strategies have therefore been limited to 
optimizing individual properties such as the thermodynamic stabil- 
ity or the binding specificity of an allosteric receptor, but will 
undoubtedly continue gaining importance as our molecular mech- 
anistic understanding of artificially engineered protein switches is 
anticipated to improve. 
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Chapter 2 


Construction of Allosteric Protein Switches by Alternate 
Frame Folding and Intermolecular Fragment Exchange 


Jeung-Hoi Ha and Stewart N. Loh 


Abstract 


Alternate frame folding (AFF) and protein/fragment exchange (FREX) are related technologies for 
engineering allosteric conformational changes into proteins that have no pre-existing allosteric properties. 
One of their chief purposes is to turn an ordinary protein into a biomolecular switch capable of transform- 
ing an input event into an optical or functional readout. Here, we present a guide for converting an arbitrary 
binding protein into a fluorescent biosensor with Forster resonance energy transfer output. Because the 
AFF and FREX mechanisms are founded on general principles of protein structure and stability rather than 
a property that is idiosyncratic to the target protein, the basic design steps—choice of permutation/cleavage 
sites, molecular biology, and construct optimization—remain the same for any target protein. We highlight 
effective strategies as well as common pitfalls based on our experience with multiple AFF and FREX 
constructs. 


Key words AFF, Biosensor, Fluorescence, FRET, FREX, Protein design, Protein engineering 


Abbreviations 

AFF Alternate frame folding 
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Fn3 Fibronectin 3 

FP Fluorescent protein 
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Introduction 


A biosensor is minimally composed of an input module, which 
interacts with the analyte, and an output module that reports on 
that interaction. Proteins excel at both roles. As input domains 
they are masters of molecular recognition, having the ability to 
bind targets tightly and specifically amidst a sea of similar-looking 
decoys. As output domains they possess a wide array of biological 
functions, among the most useful of which for biosensing are 
fluorescence and enzymatic activity. A major additional advantage 
of a protein-based biosensor is that it is genetically encodable for 
in vivo applications. 

The main challenge in designing a protein-based biosensor is 
solving the problem of how to couple input and output domains, 
both physically and functionally, so that binding the analyte pro- 
duces a detectable signal. Nature has given us some treasured, but 
rare clues in the form of proteins that undergo large-scale confor- 
mational changes in response to ligand binding (e.g., calmodulin, 
which has launched a family of fluorescent calcium sensors). The 
great majority of proteins, however, do not change their structure 
appreciably upon binding. 

Calbindin Do,, fibronectin 3 (Fn3), and ribose binding protein 
(RBP) are three such examples in which the structures of the pro- 
teins in their ligand-free and ligand-bound states are similar. To 
address this challenge, we developed two methodologies for engi- 
neering a large, binding-dependent conformational change into 
each protein, which was then detected by placement of either 
chemical fluorophores or fluorescent proteins (FPs) [1-4]. The 
methods are known as alternate frame folding (AFF) and protein/ 
fragment exchange (FREX). 

AFF and FREX both use partial sequence duplication to give a 
protein of interest (POT) a mutually exclusive choice between fold- 
ing to its normal native state ( N) or to an alternate form ( N’) that 
possesses native-like structure and function (Fig. 1). Step 1 (see 
Subheading 3) of the protocol for creating an AFF-modified POI 
(POI-AFF) involves choosing an appropriate N-terminal or 
C-terminal segment of the POI to duplicate. One or more amino 
acids are identified in the segment that, when mutated, abrogate 
binding of the POI to its target ligand. Step 2 (see Subheading 4) 
entails attaching the duplicate copy of the N-terminal or C-terminal 
segment to the C- or N-terminus of the POI, respectively, by 
means of a peptide linker. Fig. la illustrates that POI-AFF can 
either fold by using the normal order of amino acids to yield N, or 
by using a rearranged order of amino acids to generate a circularly 
permuted structure (N’). The relative thermodynamic stabilities of 
Nand N’ are tuned in Step 3 (see Subheading 5) of the protocol 
such that POI-AFF is predominantly in state N (or N’) in the 
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Duplicate segment with binding mutation 
Duplicate segment without binding mutation 
Circular permutant linker 
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Fig. 1 Schematic of AFF (a) and FREX (b) switching mechanisms. Primary amino acid sequences are indicated 
by horizontal bars with folded protein structures represented below the sequences. For the two sequences in 
parentheses an N-terminal segment (containing a critical binding residue) is duplicated. The other two 
sequences, and the structures that result from their folding, represent the analogous case in which a C-terminal 
segment is duplicated. Wavy lines indicate the copy of the duplicate segment that is orphaned in N and N’ 
conformations and is hence unfolded. The -fold of POI-FREX is shown with a packing mutation in the green 
arrow that is swapped out by the wild-type residue from the orange arrow in the N’-fold 


absence of the target ligand, and chiefly in state N’ (or N) in its 
presence. Finally, fluorophores are incorporated at locations 
that consistently report on the N S$ N’ conformational change, 
independent of the choice of POI. 

The FREX mechanism can be considered an intermolecular 
version of AFF, in which the duplicated segment is not covalently 
attached to the POI from which it was derived, but rather added in 
trans to the POI (Fig. 1b). The N’-state of a FREX-modified POI 
(POI-FREX) is the intermolecular complex of the POI and the 
fragment, which forms only in the presence of the target ligand 
and is detected by FRET between donor and acceptor fluorophores 
placed on the POI and fragment, respectively. The chief advantage 


30 Jeung-Hoi Ha and Stewart N. Loh 


of FREX and other two-component designs is that the ratiometric 
FRET change observed upon binding tends to be greater than that 
of single-component sensors, because FRET efficiency is typically 
reduced to near-zero values in the unbound state of the two- 
component sensors. The main limitation of FREX is that the POI 
and fragment should be present at close to equimolar concentra- 
tions to achieve maximum FRET response. The protocols for cre- 
ating POI-AFF and POI-FREX sensors are very similar. We outline 
the protocol for AFF below, and enumerate the modifications for 
FREX after each step. 


2 Gathering Ingredients: The POI 


1. An available X-ray or other high-resolution structures of the 
POI or homolog thereof greatly facilitate the design process. 


2. For AFF, the POI should ideally not contain any reduced Cys 
residues, as fluorophores are typically introduced by thiol- 
reactive chemistry. Cys in the POI may be tolerated vis-a-vis 
fluorescence labeling if they are buried and inaccessible to sol- 
vent. FREX can make use of fluorescent proteins for detection 
so reduced Cys residues are not inherently problematic. For 
both AFF and FREX, if oxidized Cys are present, the duplicate 
segment must be chosen (see Subheading 3) such that it is not 
crosslinked to the nonduplicated region of the POI by a disul- 
fide bond. 


3. Consider using the most stable variant of the POI available, 
e.g., one derived from a thermophilic organism. An axiom of 
protein folding is that it is far easier to destabilize a protein 
than to stabilize it. Accordingly, most of the modifications and 
tuning mutations employed herein either intentionally or 
unintentionally destabilize the POI. Starting with a stable tem- 
plate allows for greater design freedom. 


3 Step 1 of AFF Protocol: Choosing the Segment of the POI to Duplicate 


3.1 Identify 
a Binding Mutation 


The only absolute requirement for the duplicate segment is that it 
contain at least one residue that, when mutated, greatly reduces 
affinity of the POI for its target ligand. Binding knockout muta- 
tions are often known from prior functional or genetic studies, and 
they may also be deduced from an existing crystal structure of the 
POI or homolog thereof. We have found that choosing a binding 
mutation close to the beginning or end of the amino acid sequence 
is advantageous, because this allows the duplicate segment to be 
short in length. Since the duplicated amino acids extend from 
POI-AFF as N- or C-terminal tails (Fig. la), shorter segments may 


3.2 Identify 
a Circular Permutation 
Site 


3.3 Design 
the Peptide Linker 


Engineering Allosteric Protein Switches 31 


present less of a risk for aggregation or degradation, although the 
opposite may be true. In any case, a binding mutation near one of 
the termini gives one the freedom to experiment with both short 
and long duplicate segments. 


For a binding mutation near the C-terminus, the duplicated seg- 
ment ends at the C-terminus (Fig. la). It begins at a position 
N-terminal to the binding mutation and this position is chosen to 
be a surface loop or turn. The reason is that this loop becomes the 
permutation site of the N’-fold, i.e., the location at which the poly- 
peptide chain is broken and new N- and C-termini are generated. 
Interrupting an alpha helix, beta strand, or buried hydrophobic 
region is expected to be more destabilizing than disrupting a sur- 
face loop, although there are examples of successful permutation 
sites at the former locations [5-10]. The same considerations apply 
to the case where the binding mutation is near the N-terminus, 
except the duplicated segment begins with the N-terminus and ends 
at a surface loop C-terminal to the binding mutation (Fig. la). 

Inability to find a stable circular permutant (CP) is the most 
common failure point in the AFF protocol. Permutation almost 
always destabilizes a protein, and there is no reliable method for 
predicting the extent of destabilization for a given permutation site. 
The CP needs to be at least marginally stable (AG, yf14 > 2-3 kcal/mol), 
with more stable CPs requiring less optimization (see Subheading 5). 
Our approach for selecting permutation sites is to choose the first 
three to four surface loops either N-terminal or C-terminal to the 
binding mutation, depending on whether the binding mutation is 
closer to the C-terminus or N-terminus, respectively. Loops that are 
close to the binding/active site should be avoided for functional 
reasons, although xylanase [9] and beta lactamase [11] were per- 
muted at several loops proximal to their active sites without major 
loss of activity. Fortunately, all but the smallest POIs will have many 
loops from which to choose and at least one will usually be stable 
and functional enough for the AFF design. For example, RBP (277 
amino acids) has 13 surface loops (Fig. 2a). We created CPs at eight 
of these loops and all were stable, soluble, and functional. All were 
destabilized compared to wild-type (WT) RBP, however, and this 
finding demonstrates the advantage of starting with the most stable 
variant of the POI available. 


The linker functions to physically bridge the original N- and 
C-termini of the POI. It effectively becomes a new surface loop of 
the CP. As such, the amino acid sequence should be hydrophilic 
and flexible enough to not impose any new constraints on the 
protein structure. We base our linkers on Gly/Ala/Ser repeats, 
although more advanced design criteria have been discussed 
[9, 11-15]. With regard to linker length, a rule of thumb is to 
measure the N-to-C distance (C,-C,) from the structure “as the 
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Fig. 2 Examples of viable circular permutants for AFF and fragments for FREX. (a) Stable and functional CPs were 
generated by cleaving the RBP sequences at the eight surface loops centered around the positions indicated by 
black spheres, and joining the original termini by a Gly/Ala/Ser-based linker of 30 amino acids. The ribose ligand 
is shown as black sticks. (b) The Fn3-FREX sensor was created by duplicating residues 48 to the C-terminus 
(black segment. The binding mutation site (Tyr87) contacts the target ligand (SH2 domain, sticks) and the tuning 
mutation site (Ile75) packs against hydrophobic residues in the two gray beta strands shown 


3.4 Characterize CPs 


crow flies,” and calculate length using ~2.5 A per amino acid. For 
POIs in which the line-of-sight between termini is blocked by 
structure, we use a figure of <2.0 A per amino acid to account for 
the arc that the linker must take over the curved surface of the 
protein. For example, RBP has an N-to-C distance of ~40 A (Fig. 2a), 
which we spanned with a linker consisting of 30 amino acids [4]. 
If the N-to-C distance is not known precisely, it is best to err on 
the side of length, as we have found that using linkers longer than 
necessary does not dramatically destabilize the CP, in contrast to 
using linkers that are too short [16]. 


At this stage it is important to express and purify candidate CPs. 
CPs that exhibit degradation, aggregation, or loss of function 
should be rejected. The CPs are purified and their relative thermo- 
dynamic stabilities (as well as that of the POI containing binding 
mutation) are determined using chemical or thermal denaturation 
techniques [17] (see Subheadings 3.4.1 and 3.4.2). The most sta- 
ble CP is then selected for AFF gene construction (Subheading 4). 
As a final note, fluorophores are typically introduced into the AFF 
protein by means of two Cys residues introduced at the N-terminus 
and in the surface loop of the N-fold selected as the permutation 
site. Although these Cys do not generally affect structure or stabil- 
ity of WT or CP forms of the POI, it is prudent to incorporate 
them into the constructs at this point to most accurately represent 
the Nand N’ folds of POI-AFF. 


3.4.1 Obtaining Stability 
Parameters from Chemical 
Denaturation Curves 

with Urea or Guanidine 
Hydrochlorie (GdnHCI) 


1. 


2. 
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Prepare solution A and solution B of the desired buffer, pH, 
salt, etc. The two solutions are made identically except solid, 
ultrapure urea (final concentration of 8 M) or GdnHCl 
(final concentration of 6 M) is added to solution B prior to the 
aliquots of stock buffer, salt, etc. 


Add identical aliquots of concentrated protein to solution A 
and solution B. The final protein concentration depends on 
the instrumentation used to monitor unfolding, but typical 
concentrations for fluorescence and circular dichroism (CD) 
are 1-20 pM. 


. Prepare at least 25 samples consisting of evenly spaced mix- 


tures (in denaturant concentration) of solution A and solution 
B. For example, sample #1 is 100% solution A, sample #25 is 
100% solution B, and samples #2—24 contain linearly increas- 
ing concentrations of denaturant. Use of a two-syringe 
Hamilton dilutor or a manual repeating pipet is recommended 
to minimize denaturant concentration error. Incubate samples 
at the desired temperature until equilibrium is reached (typi- 
cally >2 h). 


. Scan samples using the instrument of choice (UV/Vis spectro- 


photometer, fluorimeter, CD spectrophotometer) at the desired 
wavelength. The observed signal (6,,,) follows a sigmoidal 
curve, as shown in Fig. 3 (left). Fit @,,; to the linear extrapola- 
tion equation (Eq. 1): 


0 (6, +s,{D]+ (0, + sy [D])(exp(AG,, ya - m[D]))] / (1+ (exp(AG aus - m[D])) (1) 


3.4.2 Obtaining Stability 
Parameters from Thermal 
Denaturation Curves 


1. 


where [ D] is denaturant concentration and parameters 0x 
and @y are the signals of the native and unfolded forms of the 
protein at zero denaturant concentration, sy and sy are the 
slopes of the native and unfolded baselines, A Gig is the stabil- 
ity of the protein at zero denaturant concentration, and m is 
the cooperativity parameter. The midpoint of chemical dena- 
turation (C3 equal to AGypoig/m) is a particularly useful 
parameter for ranking the relative stabilities of related protein 
variants with similar m-values (e.g., the POI and mutants 
thereof), because it is more accurate and reproducible than 
A Gingold: 


Prepare a solution of protein in the desired buffer. For CD- 
monitored denaturation, one should generally use dilute pro- 
tein solutions (1-5 pM) and a long path length cuvette (1 cm) 
to minimize aggregation at higher temperatures. 


. To establish native and unfolded baselines, start the melt at a 


temperature at least 10 °C below the beginning of the unfold- 
ing transition, and continue the melt at least 10 °C after the 
transition is 90% complete. After the melt is finished, let the 
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Fig. 3 Simulated chemical (/eff, and thermal (right) denaturation curves illustrating how C,, and 7,, values are 
calculated and interpolated, respectively. Lines are best fits of the data to the linear extrapolation equation 
(left) and to linear portions of the native and unfolded baselines (right) 


3.5 Modifications 
to Step 1 for FREX 


sample equilibrate at the starting temperature and record the 
CD signal. Thermal denaturation is considered reversible if the 
signal returns to >90% of its original value. 


3. One can obtain AG;j4 and melting temperature (Ti) parame- 
ters by fitting 6,,; to the v’ant Hoff equation (17) if denatur- 
ation is reversible. However, since thermal transitions are often 
irreversible, and because we are mostly interested in the rela- 
tive stabilities of closely related protein variants, it is sufficient 
to report apparent T,,, values by interpolating the midpoints of 
melting curves obtained under identical solution and heating 
conditions. To do so, fit the linear portions of the native and 
unfolded baselines as shown in Fig. 3 (right) and calculate Oy 
and Oy at each experimental temperature. Interpolate the tem- 
perature at which 0y—Oovs = Oovs— On. 


The steps described in Subheadings 3.1 and 3.2 are identical in the 
FREX protocol. Just as there is no method for predicting CP 
stability as a function of permutation site, there is no means for 
predicting the affinity of two complementary protein fragments 
based on the location of the cleavage site. We therefore apply the 
same criteria in choosing the binding mutation and duplicate seg- 
ment for FREX. The steps described in Subheadings 3.3 and 3.4 
do not apply to FREX. At this design stage, however, one should 
identify a site in the POI for making tuning mutations. A tuning 
mutation is often needed for AFF and will always be necessary for 
FREX, since the duplicate fragment will not form a complex with 
the POI (at reasonable fragment concentrations) unless a 
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structural defect is introduced into the POI which is then “swapped 
out” by fragment binding. A reliable tuning mutation site consists 
of a large, buried hydrophobic residue (Leu, Ie, Phe, or Tyr) that 
is in the duplicated portion of the POI and packs against hydro- 
phobic residues in the unduplicated portion (Fig. 2b). A single 
tuning site should (and usually can) be selected that will service all 
fragment lengths to be tested. Tuning consists of substituting 
progressively smaller hydrophobic residues (Val, Ala, Gly, and for 
extreme destabilization, a charged residue) into that site to pro- 
gressively destabilize the POI. 


4 Step 2 of AFF Protocol: Gene Construction 


4.1 N-frame 
Half-Gene 


4.2 \N-frame 
Half-Gene 


AFF genes are generated by constructing two half-genes corre- 
sponding to the N- and N’-frames, mutating them as needed, and 
then fusing them together. Keeping the frames separate is neces- 
sary because mutagenic PCR primers will bind to two locations in 
the full-length AFF gene due to sequence duplication. This redun- 
dancy also makes it difficult to obtain fully synthetic AFF genes 
from commercial sources. 

A Hise_-s sequence or other purification/expression tag can be 
introduced at this point. The tag should be expressed at the terminus 
of the protein that comprises the N’-fold, i.e., at the N-terminus of 
POI-AFF if a C-terminal fragment was duplicated, or at the 
C-terminus ifan N-terminal fragment was duplicated. This arrange- 
ment will help select against purifying degradation products, 
because if a cellular protease cleaves POI-AFF it will likely do so by 
attacking the less stable N’-fold. The following steps describe 
construction of an AFF-POI in which a C-terminal fragment is 
duplicated and appended to the N-terminus; analogous steps are 
taken for an AFF-POI with a duplicated N-terminal fragment. 


Introduce binding, Cys, and tuning mutations into the WT POI 
gene as dictated in Subheading 3. Amplify the N-frame half-gene 
by annealing primer 1 (containing a stop codon) to the 3’-end of 
the above-modified POI gene, and primer 2 to the 5’-end 
(Fig. 4a). Primer 2 encodes for the C-terminal half of the desired 
linker and bears a restriction site of choice at its 5'-end. The restric- 
tion site necessarily becomes incorporated into the linker and for 
this reason we favor the NotI restriction site because it encodes for 
Ala-Ala-Ala. 


Introduce an N’-frame tuning mutation into the WT POI gene if 
necessary. Design primer 3 such that it binds the POI gene at a 
position that defines the start of the duplicate polypeptide segment 
(and the N-terminus of POI-AFF) (Fig. 4a). Primer 3 begins with 
a codon for Cys. Primer 4 anneals to the 3’-end of the POI gene, 
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encodes for the N-terminal half of the linker, and ends with the 
same restriction site as in Subheading 4.1. PCRamplify to generate 
the N’-frame half-gene. Digest the two half-genes with the restric- 
tion enzyme. The products are now ready to be ligated and sub- 
cloned into an expression vector of choice (Fig. 4b). 


4.3 Alternate The amino acids imposed by the restriction site usually have little 
Method: Fusion PCR effect on the properties of the linker, especially when the linker is 
long. For short linkers or those that must be composed of a par- 
ticular sequence (e.g., protease site), these leftover residues may be 
undesirable. The alternate method allows one to eliminate the 
restriction site signature altogether. Construct the N-frame PCR 
template as in Subheading 4.1 except use primer 5 instead of 
primer 2 (Fig. 4a). Primer 5 contains enough of the linker sequence 
to overlap with primer 6 by at least 20 base pairs; this can corre- 
spond to the full or partial linker sequence depending on linker 
length. Primer 5 does not contain a restriction site. Create the N’- 
frame template as in Subheading 4.2 except use primer 6 instead of 
primer 4. Primer 6 binds to the 3’-end of the POI gene and ends 
with a full or partial linker sequence, complementary to that in 
primer 5 with at least a 20 base pair overlap, again without a restric- 
tion site. PCR amplify the N’-frame template by annealing primer 
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site binding site 
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xX 
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Cc 
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| 


fusion PCR using above products as templates and 1 & 3 as primers 


Fig. 4 Cloning strategy for constructing AFF genes. (a) Annealing sites and compositions of primers 1-6 are 
indicated below the POI gene. The restriction enzyme digestion/ligation and fusion PCR methods for connect- 
ing the Nand NV half-genes are shown in (b, c), respectively 
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3 and primer 6 to the POI gene. Mix the N-frame template, N’-frame 
template, primer 1, and primer 3, and generate the full-length AFF 
gene using the overlap extension PCR method (Fig. 4c) [18]. 
Primer 1 and primer 3 will each anneal to a second, undesirable site 
within the N’-frame and N-frame template, respectively, and this 
results in shorter PCR products. Purify the longest PCR product 
by agarose gel and subclone into the expression vector. 


The cloning steps for creating FREX constructs are simpler, since 
the duplicated DNA sequences are never physically joined and no 
linker is present. At this stage, we find it useful to fuse the gene 
of a donor and acceptor FP to the either end of the POI and frag- 
ment genes. In addition to providing a direct binding assay for 
sensor tuning (Subheading 5), an FP serves as a carrier protein to 
help the fragment express well, resist degradation, and stay solu- 
ble in cells. 


5 Step 3 of AFF Protocol: Optimization 


5.1 Binding Positive 
Control 


The objective of this step is to tune the thermodynamics of the 
sensor so that it is mainly in the N-form in the absence of ligand, 
and switches to the N’-form upon ligand binding. For the sake of 
discussion, we assume the most common outcome of Subheading 
3, Le., that the CP form of the POI was found to be less stable than 
the WT POI, and that the binding mutation was introduced into 
the N-frame. The ideal distribution of N: N’ populations in the 
absence of ligand is ~10:1. This scenario, which corresponds to the 
N-fold being ~1.4 kcal/mol more stable than the N’-fold, achieves 
an optimal balance of near-maximum fluorescence signal change 
and minimal reduction of observed ligand-binding affinity!. Which 
tuning mutation (if any) needs to be made to achieve this balance, 
and into which frame it should be placed, can be predicted from 
the results of Subheading 3.4. Introducing this mutation into 
POI-AFF during Subheading 4 can save time by requiring less sub- 
sequent optimization in Subheading 5. 


A common problem with POI-AFFs prior to their thermodynamic 
balancing is that the N-fold is so much more stable than the N’- 
fold that the ligand binds weakly or not at all. This snag is likely to 
be encountered if the POI was found in Subheading 3.4 to be 
much more stable than the CP (e.g. AT, > 10 °C or 
AAGunfora > 3 kcal/mol; see Subheadings 3.4.1 and 3.4.2). We 


Because a portion of the binding energy is used to drive the N> N’ conformational change, the observed 
K, will be greater than the intrinsic Ky of the POI by a factor of (1 + Ky+Ky-)/Ky, where K= exp (-A Guntoig/ 
RT) for the respective N- and N’-folds [3]. 
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5.2 Stability Tuning 


therefore recommend that one initially perform a positive binding 
test using saturating levels of ligand by means of a rapid and not 
necessarily quantitative assay including size exclusion chromatog- 
raphy, affinity pull-down, or shift in 7}, or C,. If no binding is 
observed, then one or more destabilizing mutations are introduced 
into the N-fold, e.g., substituting Gly, Ala, or Val at one or more 
hydrophobic packing sites identified in Subheading 3. A positive 
binding result indicates that the protein has adopted the N’-fold in 
the presence of ligand but does not reveal whether it had switched 
to the N’-fold from the N-fold, or was already in the N’-fold in the 
absence of ligand. 


Once a binding interaction has been established, the final step is to 
optimize the sensor for maximum fluorescence response and bind- 
ing affinity. Fluorescent donor and acceptor groups are attached to 
POI-AFF at its N-terminus and in the permutation loop of the 
N-frame. We favor attaching maleimide dyes to engineered Cys 
residues, although there are a variety of alternate chemistries and 
labeling strategies from which to choose’. The important consider- 
ations are that one obtains close to a 1:1 donor:acceptor ratio and 
that the chosen dyes are sensitive to short-range distance changes 
(short Forster radii for FRET), as they will be close enough to be 
in contact in the N’-fold and of variable distance apart in the 
N-fold, depending on the length and residual structure of the 
duplicated segment. To achieve 1:1 donor:acceptor labeling one 
may have to experiment with different ratios of dyes in the labeling 
step. We have also obtained satisfactory results by labeling both 
positions with a single fluorophore (pyrene excimer formation and 
BODIPY-FL self-quenching). 

To evaluate sensor performance, increasing amounts of ligand 
are added to a fixed concentration of fluorescently labeled sensor. 
Fitting the observed fluorescence (0,,;) to the one-site binding 
equation (Eq. 2) yields the dissociation constant (Kg) and fluores- 
cence change (A@=Osound—Orree) aS the parameters. Ly and Py are 
total concentrations of ligand and protein. 


1/2 


Bois = sound + (von ~ 8x ){(Le P, K,) ((L, | P, | Ki) 4-2, -K,)} / (2-P,) (2) 


The three possible outcomes are that a binding curve: (i) is 
obtained with a fitted K, value close to that of WIT POT; (ii) is 
obtained with a fitted Ky significantly larger than that of WT POI; 
and (iii) cannot be generated because no fluorescence change is 
observed. Outcome (i) indicates that no further optimization is 


* Existing AFF sensors have thus far made exclusive use of chemical fluorophores rather than genetically 
encoded FPs. The reason is that inserting an FP into the permutation loop of the N-frame would likely 
destabilize the N-fold and necessitate extensive thermodynamic rebalancing. It may be possible, however, 
to insert a CP form of the FP (in which the close proximity of its N- and C-termini would be less perturb- 
ing to the POI), or to move the FP from the permutant loop to the C-terminus of POI-AFF. 


5.3 Optional: 
Kinetic Tuning 
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necessary and the sensor is ready for use. Result (ii) suggests that the 
N-fold is too stable and that a moderately destabilizing mutation 
should be introduced into the N-frame. If A@ is close to zero 
(outcome (iii)) then one can conclude that POI-AFF either never 
switched conformation (i.e., it was in the N’-fold even in the 
absence of ligand), or that it changed conformation and the fluo- 
rophores did not report on the change. In the former scenario the 
solution is to introduce tuning mutations into the N’-frame 
instead of the N-frame. The expectation is that A@ will initially 
increase and then approach a maximum value as the N’-fold is 
progressively destabilized by packing mutations of increasing 
severity [3]. Any one of these variants can be used for sensing, 
although for the typical case in which a balance between high A@ 
and low Ky is desired, the mutant that yields outcome (i) is selected 
as the final optimized product. 


The AFF sensing mechanism is reversible and the speed at which 
the forward and reverse conformational changes occur determines 
how rapidly the sensor can detect fluctuations in analyte concentra- 
tion. It may be desirable in some applications to increase this 
response rate. For calbindin Do, and RBP, the rate limiting step of 
the N>N’ and N’>N reactions appears to involve a local but not 
global unfolding event, most likely unfolding/dissociation of the 
copy of the duplicate segment that is docked to the shared segment 
of the protein. If this result is general, any mutation that destabi- 
lizes the interaction between the shared and duplicate regions has 
the potential to accelerate the conformational switch. If one wishes 
to attempt to engineer a faster AFF sensor, we recommend first 
following all of the thermodynamic balancing procedures described 
above, then introducing kinetic tuning mutation(s) in a manner 
that does not perturb the existing N/N’ balance or ligand-binding 
affinity. For example, consider a hypothetical 200-amino acid POI 
from which residues 150-200 were duplicated to generate POI-AFF. 
Further suppose that residues Phe130 and Leul70 pack tightly 
against each other. Kinetic tuning could consist of a single 
Phe130—Ala substitution in the shared region, or Leul70—Ala 
+ Leul70’—Ala mutations in the respective duplicate segments. 
In either case the N-fold and N’-fold are in principle destabilized 
equally but the kinetic barrier to their interconversion may be 
lowered, depending on the extent to which the Phe130-Leu170/ 
Leu170’ interaction is formed in the transition state ensemble. 


As noted, it is simplest to fuse donor and acceptor FPs to the full- 
length POI and duplicate fragment at the cloning stage; however, if 
chemical fluorophores are desired then the labeling procedure is 
identical to that of AFF. One difference of FREX is that the two com- 
ponents of the sensor are on average very far apart in the absence of 
ligand (at reasonable protein concentrations), so donor/acceptor 
pairs with a wide range of Forster distances can be considered. 
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Thermodynamic tuning for FREX consists of binding experi- 
ments in which ligand is added to equimolar concentrations of 
fluorescently tagged full-length POI and fragment. For example, 
we optimized the Fn3-FREX sensor by screening the I75V, 175A, 
and I75G tuning variants of the POI in the background of the 
Y87A binding mutation (Fig. 2b). Fluorescence data are fit to Eq. 
2. Our simulations have shown that it is possible, although highly 
unlikely, that a ternary complex will form in the absence of a pack- 
ing mutation, as the entropic barrier for intermolecular folding is 
usually too large to be overcome by ligand-binding energy alone 
[3]. A more realistic concern is that the binary complex of full- 
length POI and fragment will form without ligand present. This 
occurs when the full-length POI is “overbalanced,” 1.e., so desta- 
bilized by the tuning mutation that restoration of the native pack- 
ing interaction by the fragment drives folding in the absence of 
ligand binding. It is not uncommon to span these extremes of sta- 
bility by placing a set of packing mutations at a single well-chosen 
site. A nicely balanced FREX construct is characterized by unusually 
high A@ (achieved by reducing FRET in the free state to near zero) 
and K, similar to that of the WT POI. This condition was met in 
the Fn3-FREX sensor by the I75A tuning mutation. 


6 Concluding Remarks 
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Chapter 3 


Construction of Protein Switches by Domain Insertion 
and Directed Evolution 


Lucas F. Ribeiro, Tiana D. Warren, and Marc Ostermeier 


Abstract 


A protein switch is a protein that changes between inactive (“off”) and active (“on”) states in response to 
a biomolecule or physical signal. These switches can be created by fusing two domains in such a way that 
the activity of the output domain is regulated by the input domain’s recognition of an input signal (such 
as the binding of a molecule, recognition of light). Here, we describe several methods for randomly fusing 
two domains to create domain insertion libraries from which protein switches can be identified by selections 
and/or screens. 


Key words Protein switch, Domain insertion, Circular permutation, Directed evolution 


1. Introduction 


Biological systems are often described as complex circuits consisting 
of an interacting network of molecules. A key component of these 
networks are protein switches that couple cellular functions. These 
switches change their active state (output) in response to a physical 
signal or binding event (input). The ability to create switches by 
linking any desired input and output domains would enable the 
rewiring of cellular circuitry to suit a researcher’s objective. This 
rewiring has numerous potential applications such as cancer thera- 
peutics [1], fluorescent biosensors [2-4], biomass degradation [5], 
and regulators of cell signaling elements [6]. 

Our directed evolution strategy for constructing protein 
switches involves fusion by domain insertion between two differ- 
ent proteins with the prerequisite input and output functions such 
that the behavior of the output domain is responsive to signal 
detection (e.g., a binding event) in the input domain (see Fig. 1). 
The exact fusion to construct to create a switch can be difficult to 
predict. Thus, a directed evolution approach is preferred in which 
an insert gene is randomly inserted into an acceptor gene, to create 
a library of gene fusions encoding fusion proteins. The insert gene 
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Fig. 1 Schematic depiction of the creation of protein switches by domain insertion. A protein switch is a fusion of 
two domains (by domain insertion) in such a way that the activity of the output domain is regulated by the 
input domain’s recognition of an input signal. (a) DNA sequences are depicted as /ines and (b) their corre- 
sponding proteins as geometric shapes. A light gray color of the output domain indicates that the domain is inac- 
tive or less active. The signal that modulates the switch is depicted as a black triangle 


can encode for the input or output domain. These fusion proteins 
are then subjected to selections and/or screens to find a chimeric 
protein that possesses signal-dependent behavior. Using this strat- 
egy our lab has produced several examples of ligand-activated pro- 
tein switches [1, 5, 7-18]. In one example, we created a 
ligand-activated enzyme with a 600-fold change in enzyme activ- 
ity in response ligand binding [7]. 

One of the major steps in constructing protein switches by 
directed evolution is creating diversity in the ways you fuse the two 
domains. Random insertion libraries can be created using DNase I 
or S1 nuclease to generate a single double-stranded break throughout 
the plasmid that encodes the acceptor gene [7, 10, 12] (see Fig. 2a). 
However, some drawbacks of using these nuclease-digestion meth- 
ods include the generation of insertions outside the gene-coding 
sequence and out-of-frame or inverted insertions, resulting in a 
significant fraction of the library encoding undesired and nonfunc- 
tional library members. Therefore, this strategy is best combined 
with a robust screening/selection system. In comparison to these 
nuclease methods, multiplex inverse PCR affords much greater 
control over the library composition. In inverse PCR, abutting 
primers are designed to “open up” a plasmid to facilitate domain 
insertion (see Fig. 2b). A separate inverse PCR reaction is per- 
formed for each desired insertion site, each of which requires a 
unique set of primers (see Fig. 2d). One feature of this method is 
the ability to use it for both random domain insertion (i.e., inverse 
PCR at each codon) or for insertions at sites chosen by rational 
design (i.e., achieved by inverse PCR only at designed positions). 

Another layer of diversity can be created by circular permutation 
of the insert gene, thus increasing the overall diversity of the 
protein switch library. Conceptually, circular permutation is chang- 
ing the intramolecular order of amino acids. Circular permutation 
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Fig. 2 Schematic of the methods to create random domain insertion libraries, and circularly permuted insert 
genes. (a) The plasmid DNA containing the acceptor gene can be randomly linearized using DNase | or $1 
nuclease. (b) The plasmid can be linearized at targeted positions in the acceptor gene by multiplex inverse 
PCR. (c) Inverse PCR on a fused gene duplication is used to make a library of circularly permuted inserts. (d) 
Scheme of primers for multiplex inverse PCR. Three forward and reverse primers are shown for the first three 
codons of a gene. Different forward and reverse primer pairs can be used in PCR to create codon deletions or 
duplications. For example, using 1F and 2R duplicates codon 2 at either end of the PCR product. Using primers 
3F and 2R results in a PCR product that lacks codon 3 


is performed such that the original N- and C-termini of the protein 
is linked by peptide linker of appropriate length and “opened up” 
at a different point in the gene to yield new N- and C-termini at 
which the insert gene can be inserted into the acceptor gene. 
Previous studies suggest that the sequence space accessed by com- 
bining circular permutation with random domain insertion is rich 
in active switches with very large differences in activity between 
their “on” and “off” states [7, 14]. Circular permutation libraries 
can be created by PCR on a plasmid containing a tandem repeat of 
the gene joined by DNA encoding the linker between the N- and 
C-termini (see Fig. 2c). The tandem repeat lacks a stop codon in 
the first repeat. 

The PCR reactions for preparing the acceptor insertion sites 
(see Fig. 2b) and preparing the circularly permuted insert (see Fig. 2c) 
offer an opportunity for two types of additional diversity. First, 
forward and reverse primers can be combined in different combi- 
nations to delete or duplicate a codon (see Fig. 2d). Second, DNA 
encoding linkers of different lengths and compositions (both variable 
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2 Materials 


linkers and linkers of specific sequences) can be appended to the 5’ 
end of the primers for the inverse PCR reactions. These linkers 
between the two domains can be important for the switching prop- 
erty [5, 15, 16]; therefore, variable linkers and fixed composition 
linkers are often tested. 


oN DA 7 be 


. QIAPrep Spin Miniprep Kit (Qiagen). 
. Invitrogen PureLink Gel Extraction Kit (Invitrogen). 
. DNA Clean and Concentrator Kit (Zymo Research Corp., 


Irvine, CA). 


. DNase I (Promega). 

. 0.5 M Tris-HCl pH 7.5. 

. 50% glycerol. 

. 10 mM MnCl. 

. 20 mg/mL BSA solution (100x stock, e.g., New England 


Biolabs). 


. 1M EDTA. 
. T4 DNA Polymerase (e.g., New England Biolabs, Ipswitch, 


MA). 


. T4 DNA Ligase (2,000,000 U/mL) (e.g., New England 


Biolabs, Ipswitch, MA). 


. 10 mM dNTPs each. 
. Antarctic Phosphatase (e.g., New England Biolabs, 


Ipswitch, MA). 


. Sl Nuclease (e.g., Promega, Madison, WI). 
. 10x S1 Nuclease buffer (500 mM sodium acetate, 2800 mM 


NaCl, 45 mM ZnSOu, pH 4.5). 


. Phusion High-Fidelity PCR Master Mix. 
. 1x TAE buffer: 40 mM Tris-base, 20 mM Acetic Acid, 1 mM 


EDTA, pH 7.6. 


. Nanodrop. 


. Gene Pulser Electroporation Cuvettes (Biorad, Hercules, 


California). 


. SOC medium: 2% Tryptone, 0.5% Yeast Extract, 10 mM NaCl, 


2.5 mM KCl, 10 mM MgCh, 10 mM MgSO,, 20 mM Glucose. 


. 245x245 mm Square Bioassay Dish (Corning, New York, 


NY). 


. LB Agar:10% Tryptone, 5% Yeast Extract, 10% NaCl, and 15 g 


Bacto Agar per 1 L of medium. 


3 Methods 


3.1. Preparing 
Acceptor Vector Using 
DNase | 


10. 


ll. 


Engineering Protein Switches by Domain Insertion 47 


. Prepare 100 pg of plasmid DNA containing the gene encoding 


the acceptor domain (see Note 1). 


. Prepare a 50 pL working solution of DNase I (1 U/mL) ina 


0.5 mL microcentrifuge tube by combining 50 U DNase I, 
25 mM Tris-HCl pH 7.5 and 50% glycerol. Store at —20 °C 
(see Note 2). 


. Prepare 1.2 mL of diluent solution in a 1.5 mL microcentri- 


fuge tube with a final concentration of 50 mM Tris-HCl pH 
7.5, 1 mM MnCl, and 0.5x of BSA. Vortex to mix and store at 
room temperature (see Note 3). 


. Before creating a library, determine the concentration of DNase 


I that will produce the most DNA vectors with a single double- 
stranded break. To determine this optimum concentration, 
combine 20 pg of plasmid DNA and 380 pL of the diluent solu- 
tion in a 1.5 mL microcentrifuge tube. Mix and centrifuge 
briefly to collect all the liquid at the bottom of the tube. Aliquot 
95 wL each into four 0.5 mL microcentrifuge tubes and incu- 
bate these tubes at 22 °C for 10 min. 


. After the incubation, add 5 pL of diluted DNase I to each tube 


at different concentrations (i.e., 1, 2, 4, and 8 mU) and incu- 
bate at 22 °C for 8 min. 


. To inactivate the DNase I, add 2.4 pL of 0.5 M EDTA to each 


tube and incubate at 75 °C for 10 min. 


. Purify each reaction using the Zymo DNA Clean and 


Concentrator kit or a PCR purification lat column. Elute with 
water. 


. Run 5-10 pL of each sample on a 0.8% agarose gel made with 


lx TAE buffer to determine the concentration of DNase I that 
yields the best digestion results (see Notes 4 and 5). 


. Once the optimal DNase I dilution is determined, prepare 


80 pg plasmid DNA in 1520 mL of diluent solution and aliquot 
95 pL into 16 microtubes (i.e., 5 pg DNA per tube). Incubate 
for 10 min at 22 °C. 


Add 5 pL of the correct DNase dilution to each tube in 30 s 
intervals (it will take 8 min to add DNase I to all tubes). After 
8 min, stop the reaction as described in step 6 (see Note 6). 


Combine all the reactions into a 15 mL Nalgene test tube and 
purify the DNA using four 25-yg columns from a Zymo DNA 
Clean and Concentrator Kit. Elute the DNA from each column 
in two 20 pL volumes of water pre-warmed to 65 °C. Combine 
the eluate from each spin column (160 pL total) (see Note 7). 
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12. Determine the DNA concentration using a NanoDrop and 
run 100 ng of DNA in an agarose gel to estimate the percent 
of linear DNA using a gel-imaging software (see Note 4). 


13. Once the concentration is known, proceed to the repair step 
(see Note 8). Prepare multiple reactions consisting of DNA, 
T4 DNA ligase (160 cohesive end units/pg linear DNA), T4 
DNA Polymerase (1 unit/pg linear DNA), 200 pM dNTPs, 
lx T4 DNA Ligase buffer, and 1x BSA in water. Each reaction 
should be <50 pL. Incubate at 12 °C for 20 min. 


14. Stop the reaction with 10 mM EDTA (final concentration) 
followed by incubation at 75 °C for 10 min. 


15. Purify the DNA using a 25-"g Zymo DNA Clean and 
Concentrator Kit. 


16. Separate the linear DNA from the supercoiled and nicked 
DNA via DNA gel electrophoresis. Run the gel at 90 V on a 
0.8% agarose gel made with 1x TAE buffer for approximately 
1.5 h or until the DNA is separated into three distinct bands 
(see Note 9). 


17. Excise the band containing the linear DNA and purify using 
Invitrogen’s Pure Link Gel Extraction Kit according the man- 
ufacturer’s instructions. 


18. Dephosphorylate the repaired linear plasmid using Antarctic 
Phosphatase. For every microgram of DNA, add 20 units of 
enzyme. Incubate for 1 h at 37 °C. 


19. Heat inactivate the enzyme by incubating the reaction at 65 °C 


for 10 min. 
3.2 Preparing 1. Prepare 50 pg of plasmid DNA containing the gene encoding 
Acceptor Vector Using the acceptor domain (see Note 1). 
S1 Nuclease 2. Ina 1.5 mL microcentrifuge tube, add 26 pg of plasmid DNA, 


1x S1 Nuclease buffer, and 125 units of S1 Nuclease to a final 
volume of 325 pL. Aliquot 25 pL each into 13 microtubes 
(1.e., 2 pg DNA per tube). Incubate at 37 °C for 20 min. 


3. Combine all the reactions and purify using two 25-1g spin col- 
umns of a Zymo DNA Clean and Concentrator Kit. Elute the 
DNA from each column into two 20 pL aliquots of water pre- 
warmed to 65 °C. Combine the eluate from each spin column 
(80 pL total) (see Note 7). 


4. Determine the DNA concentration using a NanoDrop and elec- 
trophorese 100 ng of sample to estimate the percentage of linear 
DNA using gel-imaging software (see Note 4). 


5. Once the concentration is known, proceed to the repair, gel 
purification, and dephosphorylation steps (see Subheading 3.1, 
steps 13-19). 


3.3 Preparing 
Acceptor Vector Using 
Multiplex Inverse PCR 


2. 


10. 


11. 


12. 


13. 
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. Design one set of primers for each insertion site (see Fig. 2c 


and Note 10). 


Purify the plasmid using the Qiagen Plasmid Miniprep Kit 
following the manufacturer’s instructions. 


. Measure the concentration of the DNA using a NanoDrop 


(see Note 11). 


. Dilute the template DNA to 10 ng/pL in DNase-free water. 


You will need 1 pL per PCR reaction. 


. Prepare 10 uM primer mixes in a 96-well plate (one primer mix 


for each primer pair). To each well, add 10 pL of 100 uM for- 
ward primer, 10 pL of 100 pM reverse primer, and 80 pL of 
water to bring the primer mix to 100 pL total (see Note 12). 


. Prepare a dilute DNA solution by mixing the template DNA 


and DNase-free water in a 1.5 mL microcentrifuge tube. For 
every PCR reaction, add 8 pL of water and 1 pL of the diluted 
DNA stock solution. A separate PCR reaction is required for 
each primer mix. 


. To each well of a 96-well PCR plate, add 9 pL of the diluted 


DNA solution, 1 pL of primer mix, and 10 pL of Phusion 2x 
Master Mix. Decrease the volume setting on the pipette and 
mix by pipetting up and down several times (see Note 13). 


. Centrifuge for 30 s at 720xy to collect all reaction compo- 


nents at the bottom of the wells. 


. Use the following guidelines to design your amplification pro- 


gram (see Note 14): 
e 1 cycle: 98 °C for 2.5 min. 


e 25 cycles: (1) 98 °C for 30 s, (2) T,, for 30 s, and (3) 
72 °C for 30 s/kilobases of your vector. 


¢ 1 cycle: 72 °C for 10 min. 


Analyze 5 pL from 20 or more random samples using DNA gel 
electrophoresis with a 0.8% agarose gel at 90 V for 40 min to 
confirm successful amplification of most reactions. 


After confirmation, pool together 5 pL from each PCR reac- 
tion. Using DNA electrophoresis and a 0.8% agarose gel, elec- 
trophorese your sample at 90 V for 1 h to allow separation of 
the template DNA and PCR product. 

Excise the PCR product from the gel and purify using 
Invitrogen’s PureLink Gel Extraction Kit. 

Check the concentration and quality of the DNA using a 
Nanodrop. If the 260/280 and 260/230 values are too low, 


purify the DNA sample using the Zymo Clean and Concentrate 
Kit and elute in 20 pL of DNase-free water (see Note 11). 
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3.4 Preparing 
Circular Permuted 
Insert 


3.5 Ligation 
of the Insert 
and Acceptor DNA 


3.6 Transformation 


1. 


. Construct a vector as shown in Fig. 2c by duplicating the insert 


gene by end-to-end fusion, connecting them with a linker (see 
Notes 15-17). 


. Design a forward primer starting at the first base of the selected 


codon and a reverse primer starting at the last base of the previous 
codon (the melting temperatures should be between 60 and 
65 °C (see Notes 10 and 18-20). 


. The PCR protocol is the same as used for multiplex inverse 


PCR (see Subheading 3.3, steps 2-13). 


Starting with 200 ng of prepared vector, calculate the pmol of 
vector using the following formula: 


pmol = (mg of DNA) / (kilobase pairs of vector x 650 Da). 


. Using the same formula, calculate the pmol of insert required 


to achieve a 3:1 or 5:1 insert-to-vector molar ratio. 


. Add the calculated amount of prepared vector and insert to a 


thin-walled PCR tube. Add 2 pL of 10x T4 DNA Ligase Buffer 
and fill to 19 pL. Add 1 pL of T4 DNA Ligase, flick the tube 
to mix and centrifuge briefly to collect the reaction compo- 
nents at the bottom of the tube (see Notes 21 and 22). 


. Place the ligation reaction in a thermocycler. Incubate for 


12-18 h cycling between: 25 °C for 30 s and 10 °C for 30 s. 


. After the incubation, purify the DNA using the Zymo Clean & 


Concentrator Kit. Elute the DNA into a 1.5 mL microcentrifuge 
tube in 10 pL of water. Concentrate your sample to approximately 
4 pL using a vacufuge at room temperature (see Note 23). 


. Turn on the Gene Pulser. Adjust the voltage to 2.5 kV. 
. Place the following on ice: high-efficiency (>10° cfu/ppg DNA) 


electrocompetent cells, a 0.2 cm Gene Pulser Cuvette, a 
0.5 mL microcentrifuge tube and your purified and concen- 
trated ligation (see Note 24). 


. Once the cells are thawed, aliquot 40 pL of cells into the 


microcentrifuge tube with the ligation, ensuring the cells 
engulf the DNA sample. Mix well. 


4. Pipette the cells into the cuvette (see Note 25). 


. Place the cuvette in the Gene Pulser and slide into the cham- 


ber, ensuring that the cuvette clicks into place. 


. Simultaneously hold down the pulse buttons until you hear the 


beep. Then, release the buttons. 


7. Quickly, add 1 mL of SOC media to the cuvette. 


. Transfer the cell/SOC mix to a 1.5 mL microcentrifuge tube. 


4 Notes 
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. Incubate the sample at 37 °C for | h. 
. Plate dilutions of the cells on 10 cm dishes and plate the 


remainder on 245 x 245 mm plate bioassay dishes with LB 
agar (see Note 26). 


. Incubate the plates at 37 °C for 12-18 h. 


. The next day, count the number of colonies on the 10 cm dishes 


to determine the number of transformants (see Note 27). 


. You can further confirm the diversity of this “naive” domain 


insertion library by performing colony PCR on several transfor- 
mants and/or growing 10 mL LB cultures of samples, purifying 
their DNA and analyzing the DNA sequence (see Note 28). 


. We have found that the Qiagen Miniprep kits have better yield 


and less genomic DNA contamination compared to the 
Midiprep or Maxiprep kits. 


2. This working solution is good for at least 3 months. 


3. If chilled on ice, solids may fall out of solution. If this happens, 


a fresh diluent must be prepared. 


. The slowest migrating band is nicked open plasmid. The fast- 


est migrating band is supercoiled DNA and the band in the 
middle is linear DNA. 


. The optimal DNase I concentration is the one that produces a 


sharp band of linearized plasmid. To achieve this, 30-50% of 
the plasmid usually remains supercoiled. Overdigestion of the 
plasmid will result in smearing indicating deletions at the ends 
of the digestion products. 


. The 30 s staggered intervals are to insure that each tube will be 


stopped at exactly 8 min. We recommend that experiments 
be performed on the same day as the optimization. If the 
reactions will be completed on a different day, prepare fresh 
DNase dilutions. 


7. The prepared DNA can be stored at —20 °C. 


10. 


. This step creates blunt ends by filling in the end or by chewing 


away any overhanging DNA. Note the amount of T4 DNA 
Ligase and 'T4 DNA Polymerase should be calculated based on 
the amount of linear DNA. 


. Do not load more than 2.5 pg per lane (of the medium-sized 


lanes). If possible, do not use UV light or ethidium bromide as 
these can cause DNA damage. 


Your primers should adhere to the following guidelines as 
much as possible: (1) 40-60% GC, (2) at least 18 nucleotides 
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ll. 


12. 
13. 


14. 


15. 


16. 


in length, (3) the last two nucleotides on the 3’ each of primer 
should be Cs or Gs, and (4) there should be no more than 3 
Cs or Gs in the final five nucleotides of the 3’ end of the primer. 
All primers should be designed such that they have approxi- 
mately the same melting temperature (ideally 60-65 °C). The 5’ 
end of the forward primer should begin at the codon down- 
stream of the insertion site. The 5’ end of the reverse primer 
should begin at last nucleotide of the codon that is upstream of 
the insertion site. For example, if you want to insert a gene 
between codons 2 and 3 of the acceptor DNA, the 5’ end of 
your forward primer should begin at the first nucleotide in 
codon 3 and the 5’ end of your reverse primer should begin at 
the last nucleotide in codon 2. This will create an in-frame 
opening between codons 2 and 3 (see Fig. 2d). You can also 
duplicate and delete codons by shifting the primers upstream 
or downstream (see Fig. 2d). We have written a MATLAB 
script to design such primer pairs, which is freely available. 
After designing the primers, use a PCR simulation software 
such as AmplifX to determine the optimal annealing tempera- 
ture (7),.) for a few of the primer pairs. Use the lowest J, 
value of the primer pairs. Also use the software to check for the 
likelihood of nonspecific primer annealing to the template or 
the formation of primer dimers. Alternatively, you can subtract 
3-4 °C from the average melting temperature of all the primers 
as a good estimation for T),,.. 


Check the quality of your DNA. A reading with 260/280 
value of 1.8—2.0 and a 260/230 value of 1.8—2.3 will yield the 
best PCR results. 


You can scale down the volume of the primer mix if necessary. 


This process can be facilitated by using 12-channel basins and 
12-channel pipettes. 


You can minimize the formation of undesired PCR product by 
performing touch-down PCR using the following amplification 
program: (i) Preheat the thermocycler to 98 °C, (ii) 1 cycle: 
98°C for 3 min, (iii) 15 cycles: (1) 98 °C for 30 s, (2) 
Toor + 15 °C for 30 s, and (3) 72 °C for 30 s/kb of GOI. The 
annealing temperature should decrease by 1 °C for each cycle, 
(iv) 10 cycles: (1) 98 °C for 30 s, (2) TJ; for 30 s, and (3) 
72 °C for 30 s/kb of GOI, (v) 72 °C for 10 min. 


Circular permutation aims to change the position of N- and 
C-termini of the protein without disturbing its three- 
dimensional structure. In general, the N- and C-terminals of 
the original protein should be close enough to allow their union 
(<10 A) [19]. 

When the gene (without a stop codon) is already available in 
cloning vector and a convenient restriction site is available at 


17. 


18. 


19. 


20. 


21. 


22. 


23. 
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the end of that gene, you can PCR amplify the second copy of 
gene (with a stop codon) to add a suitable linker and directly 
ligate this amplified gene at the 3’ end of the first gene copy 
using the restriction site. When no restriction site is available, 
the genes can be fused together by overlap PCR. 


Linker connecting the original N- and C-termini: Linker 
length and composition can interfere with the folding, topology, 
and stability of the protein, so it must be rationally designed. 
Here are some considerations. (a) Length—the distance 
between the alpha carbons of the N- and C-termini residues 
can be determined using Pymol or similar molecular visualiza- 
tion software. Depending on the secondary structure of the 
linker residues, one amino acid typically spans a distance of 
1.5-3.5 A. Linkers that bridge distances greater than 10 A 
require more careful design. Depending on the situation, the 
protein’s surface properties must be taken into account. (b) 
Composition—Flexible backbones can help accommodate 
emerging conformational strains; thus, in most cases, glycine- 
rich (Gly) bridges are preferred. Typically, a GlySerGlyGly 
linker is sufficient to span a distance of 7-8 A. However, linkers 
of more diverse composition can be tested. 


Inter-domain linker: In order to alleviate possible disturbances 
caused by domain insertion, we usually add two residues to the 
new termini of the permuted protein. This can be done using 
degenerate primers with two codons of 5’- NNK-3’ at 5’ end of 
the forward and reverse primers. Note that the reverse primer 
must encode the reverse complement of 5’-( NNK),-3’. 


Reducing library size: For large proteins, purchasing compre- 
hensive sets of primers can be expensive. To decrease primer 
costs and the library size, you can create select circular permuta- 
tions that focus on residues that are solvent accessible, flexible, 
loosely packed, and between secondary structure elements. 


Phosphorylated primers can be ordered from most manufac- 
turers or unphosphorylated primers can be phosphorylated 
using T4 Polynucleotide Kinase (New England Biolabs, 
Ipswich, MA) following the manufacturer’s instructions. 
Alternatively, you can phosphorylate the PCR product. 


For the best results, do not exceed 10 ng of vector per pL of the 
ligation reaction. Note that the ligase used is high concentra- 
tion ligase (2,000,000 units/mL) for this blunt end ligation 
reaction. 


The ligation reaction can be scaled up. For the best results, 
prepare a master mix of the ligation and split it into 20 pL ali- 
quots for the incubation. 


Alternatively, you can elute in 6 pL of DNA but this may 
decrease your yield. 
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24. 


25% 


26. 


27. 


28. 


You can use any high-efficiency competent cells that suit your 
system. We recommend NEB 5-alpha cells. 


Check to make sure the cell/DNA sample reaches the bottom 
of the cuvette, that it does not have any bubbles and that it is 
making contact with both electrodes within the cuvette. 


Make one large 245 x 245 mm plate (approximately 250 mL 
of LB agar) and at least three 10 cm petri dishes of LB agar 
(approximately 25 mL of LB agar/plate). 


If you used multiplex inverse PCR to prepare your vector, you 
want the number of transformants to be >5 times the number of 
possible variants to have a high probability that the library con- 
tains all possible variants (assuming that each library member is 
expected to appear at the same frequency). For an in-depth 
discussion of this topic, see Bosley and Ostermeier [20]. 


Both DNase I and S1 Nuclease generate libraries with deletions 
(and occasionally duplications) distributed along the acceptor 
sequence. These deletions contribute to the sequence variability 
of the library, potentially generating relevant diversity for the 
creation of new properties in the chimeric proteins [11, 19]. 
Library construction using S1 is usually easier and results in 
smaller deletions since S1 nuclease digestion of plasmid DNA is 
halted after the first double-stranded break [10]. However, S1 
digestion can be heavily biased to occur in inverted repeats 


regions in the plasmid, if they are present [8]. 
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Peptide Switches 


Chapter 4 


Catalytic Amyloid Fibrils That Bind Copper 
to Activate Oxygen 


Alex Sternisha and Olga Makhlynets 


Abstract 


Amyloid-like fibrils assembled from de novo designed peptides lock ligands in a conformation optimal for 
metal binding and catalysis in a manner similar to how metalloenzymes provide proper coordination envi- 
ronment through fold. These supramolecular assemblies efficiently catalyze p-nitrophenyl ester hydrolysis 
in the presence of zinc and phenol oxidation by dioxygen in the presence of copper. The resulting hetero- 
geneous catalysts are inherently switchable, as addition and removal of the metal ions turns the catalytic 
activity on and off, respectively. The ease of peptide preparation and self-assembly makes amyloid-like 
fibrils an attractive platform for developing catalysts for a broad range of chemical reactions. Here, we present 
a detailed protocol for the preparation of copper-containing fibrils and for kinetic characterization of their 
abilities to oxidize phenols. 


Key words Peptides, Fibrils, Catalysis, Dioxygen activation, Phenol oxidation 


1. Introduction 


Metalloenzymes utilize metal ions to catalyze some of the most 
challenging chemical reactions, such as methane oxidation, lignin 
degradation, and nitrogen fixation [1-5]. Even more impressive is 
the fact that metalloenzymes are capable of performing these reac- 
tions in a very selective and stereospecific manner. The high effi- 
ciency and selectivity of natural metalloenzymes has inspired much 
effort to (re)design catalytic metalloproteins to improve selectivity 
for a substrate of choice and to catalyze novel chemical transforma- 
tions [6]. Many of the current state-of-the-art catalyst design strat- 
egies take advantage of both existing and de novo designed proteins 
to optimize the metal coordination sphere and to provide a bind- 
ing site for the substrate. Recently, this approach was taken to the 
next level by employing self-assembly of short peptides into fibrils. 
The fibrils provide stability and the appropriate coordination 
sphere for the metal ions necessary to facilitate chemical transfor- 
mations [7-9]. Amyloid-like fibrils, assembled from short 
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peptide fibrils 


(7-residues or less) de novo designed peptides, catalyze ester 
hydrolysis in the presence of Zn**. The most catalytically active 
peptide, Ac-IHIHIQI-CONH), was identified by varying hydro- 
phobic core residues and the sequence of hydrophilic portion of 
the peptide. This peptide self-assembles into fibrils and binds zinc, 
the resulting material catalyzes hydrolysis of p-nitrophenyl esters 
with efficiency comparable to that of the natural enzymes by weight 
[7]. Peptides that do not form fibrils, but have similar sequences 
otherwise, showed significantly lower catalytic activity. The corre- 
lation between the propensity to form fibrils and the ability to cata- 
lyze chemical reactions suggests that the rigid fibril arrangement of 
functional groups in peptide assemblies locks the metal ion in a 
coordination environment that promotes catalysis. The strategy of 
using peptide self-assembly for design of catalysts can be further 
expanded to other chemical transformations (Fig. 1). We recently 
applied this design strategy to create a catalyst for oxygen activa- 
tion [10]. We focused on Cu?* as a metal cofactor, because it is 
known to facilitate various redox reactions (e.g., C-H oxidation [11], 
epoxidation, etc. [12]), and is stable in an aqueous environment at 


peptide fibrils with Cu2* 


/ 


2+ 
Cu me) oO. 


DMP 


\ 


peptide fibrils with Cu’* 


pNPA 
NO, 
Zn?* 
2 °° 
peptide fibrils with Zn2* fe) 
+ —& © 
fe) 
NO, 


Fig. 1 Formation of fibrils creates a coordination sphere optimal for metal ion binding (Cu?* or Zn?*). These 
amyloid assemblies can catalyze ester hydrolysis and DMP oxidation 
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neutral pH. We investigated the ability of self-assembling supra- 
molecular catalysts to catalyze oxidative coupling of 
2,6-dimethoxyphenol (DMP), a well-characterized reaction that 
produces a bright orange product [13, 14]. Several highly reactive 
peptides that are capable to efficiently oxidize DMP using dioxy- 
gen as a cofactor were identified. Just like in the case of zinc- 
binding supramolecular catalysts of ester hydrolysis, the 
oxygen-activating ability strongly correlated with the propensity of 
peptides to form a f-sheet structure. 

Catalytic efficiency of the self-assembling peptides is impressive 
given the simplicity and adaptability of the design. A large number 
of short peptides can be screened in a high-throughput fashion. 
Additionally, mixing of different peptides prior to fibril formation 
provides access to a large number of different arrangements of 
functional groups in a single amyloid-like supramolecular assembly. 
Feasibility of this idea has been previously demonstrated: mixing 
peptides with different residues at position 6 produced fibrils more 
active than fibrils assembled from uniform peptides [7]. Simple 
variation of side chains in the hydrophilic positions of the peptide 
sequence allows for exploration of many different coordination 
arrangements that efficiently support the metal ion in all oxida- 
tion states required for catalysis [15]. Enzymes selectively recog- 
nize their substrates through multiple contacts with the protein’s 
side chains. This property could be potentially engineered into 
catalytic fibrils. 

Here, we provide a detailed protocol for robust preparation 
and kinetic characterization of supramolecular peptide assemblies 
that bind Cu** and catalyze phenol oxidation. The assay we devel- 
oped is based on the benchmarked reaction of DMP oxidation, 
often used to assess activity of laccase enzymes [16-19]. This assay 
provides a simple readout signal, is easy to perform, and is suitable 
for high-throughput screening in 96-well plates. We also present 
an optimized protocol for the preparation of heteropeptidic assem- 
blies, where peptides with different primary sequences are incorpo- 
rated into a single fibril. 


All solutions need to be prepared using MilliQ water, HPLC grade 
organic solvents, and analytical grade reagents. 


1. 1 M Hepes-KOH buffer, pH 8: Weigh out 9.53 g of Hepes 
free acid ina 50 mL Falcon tube. Add 30 mL of water to dissolve 
Hepes-free acid and adjust pH using 5 M KOH, and then add 
water to 40 mL (see Note 1). 

2.50 mM CuSO, solution in water: Weigh out 0.5 g of 
CuSO,*5H,O and add water to 40 mL. The final concentration 
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3 Methods 


3.1 Preparation 
of Peptide Stock 
Solution at pH 2 


of this solution should be confirmed using ICP or by measuring 
absorbance of Cu-Zincon complex (see Note 2). 


. 10 mM HCI (see Note 3). 


4. 80 mM 2,6-dimethoxyphenol (DMP): weigh out 20-30 mg of 


0 ON BD 


DMP solid is a microcentrifuge tube, add 1 mL of isopropanol, 
and invert the tube to dissolve the solid. Further dilute the 
resulting solution using isopropanol to a final concentration of 
80 mM (see Note 4). 


. 8 M urea solution in water: weigh out 2.4 g of urea, then add 


water to 5 mL. 


. Isopropanol. 

. 96-well plates. 

. Multichannel pipette. 

. UV-Vis spectrophotometer. 
10. 


Plate reader. 


Peptides were synthesized by manual Fmoc solid-phase synthesis at 
elevated temperature using Rink Amide or Wang resin and Fmoc- 
protected amino acids using a previously reported protocol [20]. 
The crude peptides were precipitated and washed with cold methyl- 
tert-butyl ether, and purified using reverse phase High Performance 
Liquid Chromatography. The identities of the peptides were con- 
firmed using MALDI-TOF mass spectrometer and the purity was 
evaluated on analytical HPLC instrument. Pure (>90%) lyophilized 
peptides were stored at —20 °C (see Note 5). 


1. 


2. 


Calculate the extinction coefficient of peptide at 214 nm 
(see Note 6). 


Prepare a 1 mM stock solution of peptide in 10 mM HCL. 
Using the Beer- Lambert law, calculate the concentration of the 
peptide that will give an absorbance at 214 nm between 0.2 
and 1. Dissolve a small amount of solid peptide in 1 mL of 
10 mM HCl, dilute the stock of peptide using 10 mM HCl 
(measure at least 10 pL of peptide, perform serial dilutions for 
peptides with high ¢);4), and measure the absorbance. If the 
concentration is above 1 mM, dilute the stock using 10 mM 
HCl. If the concentration is less than 1 mM, add more solid 
peptide. Some peptides do not dissolve completely. If this is the 
case, centrifuge the solution at 10000 x gy for 5 min, carefully 
remove the supernatant, and then measure concentration. Once 
the stock is at 1 mM, perform dilutions and retake absorbance 
measurement at least one more time to ensure reproducibility. 


3.2 Kinetic Assay 


1. 


10. 


11. 


12. 
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Determine the path length in the 96-well plate for a volume of 
200 pL (see Note 7). 


. Prepare two working buffers: (1) 25 mM Hepes-KOH, pH 


8—mix 0.5 mL of 1 M Hepes-KOH and 19.5 mL of water in 
a 50 mL Falcon tube; (2) 28.4 mM Hepes-KOH, pH 8—mix 
0.568 mL of 1 M Hepes-KOH and 19.4 mL of water. 


. Dilute 50 mM CuSO, into water to 10 mM and then further 


dilute the solution tenfold to make 1 mM stock of CuSO, in 
water (see Note 8). 


. Prepare 2 mL of 2 mM DMP solution in water: mix 1.825 mL 


of water, 125 pL of isopropanol, 50 pL of 80 mM DMP. 


. Prepare a mixture of peptide and Cu’* in buffer: mix 40 pL of 


1 mM peptide stock in 10 mM HCl and 20 pL of 1 mM Cu** 
in water, then add 440 pL of 28.4 mM Hepes-KOH buffer, 
pH 8 (see Note 9). This will give 500 pL of solution with 
40 uM Cu** and 80 uM peptide. For a blank solution, mix 
40 pL of 10 mM HCl, 20 pl of 1 mM Cu” in water, and 
440 pL of 28.4 mM Hepes-KOH buffer, pH 8. 


. Fill reaction reservoir (trough) with 25 mM Hepes-KOH buf- 


fer, pH 8. Using a 12-channel pipette dispense 100 uL of this 
buffer into each well to be used of the 96-well plate. 


. Add 50 pL of Cu-peptide solution (40 pM Cu** and 80 uM 


peptide) or blank. Each sample should be analyzed at least 
three times (use three wells). 


. Set up an experiment on the plate reader to follow the absor- 


bance at 476 nm, taking measurements every 10 s for 5 min. 


. Fill another trough with 2 mM DMP. Using a 12-channel 


pipette deliver 50 pL of 2 mM DMP into each well. Gently 
pipette up and down four times to mix (see Note 10). 


Place the 96-well plate into the plate reader and start the mea- 
surements (see Note 11). 


Plot the absorbance at 476 nm as a function of time (s) and 
determine the slope of the linear portion of each kinetic trace; 
this will give the change in absorbance per second. To calculate 
initial rate, divide the slope by the extinction coefficient of the 
DMP oxidation product (3,3’,5,5’-tetramethoxy biphenyl- 
4,4’-diol)—14800 M7! cm7! [16, 21]. The following formula 
illustrates how to calculate initial rate in units of pmol/min 
(1.35 is a coefficient for path length correction): 


Rate (uM / min) =(slope’1.35*10° *60) / 14,800 


Compare the initial rates of blank sample (Cu** in buffer) and 
Cu-peptide samples (see Note 12). 


64 Alex Sternisha and Olga Makhlynets 


3.3 Mixing Peptides 
with Different 
Sequences 


4 Notes 


1. 


Prepare 4 mM peptide stocks in 10 mM HCL, as described in 
Subheading 3.1 (see Note 13). 


. Mix the two different peptide stock solutions (4 mM) in differ- 


ent ratios to a final volume of 40 pL. 


. Add 150 pL of 8 M urea to each sample, mix and incubate at 


room temperature for 15 min (see Note 14). 


4. Add 40 pL of 2 mM Cu’ in water. 


. Initiate fibril formation by adding 1.77 mL of 28.4 mM Hepes- 


KOH buffer, pH 8. Final concentrations of peptide and Cu** 
are 80 and 40 pM, respectively (see Note 15). 


. For the blank sample, mix 40 pL of 10 mM HCl, 150 pL of 


8 M urea, 40 pL of 2 mM Cu”, and 1.77 mL of 28.4 mM 
Hepes-KOH buffer, pH 8. 


. Set up kinetic assay as described in Subheading 3.2, steps 


6-10. 


. Keep the solution of 1 M Hepes-KOH at 4 °C, warm up to 


room temperature before use. Tris buffer coordinates to Cu** 
and therefore should not be used in this assay. For screening at 
different pH, we also used MES (pH 6) and TAPS (pH > 8). 


. Atomic absorption standard for Cu’* was initially used for 


screening. However, this solution (1 g/L = 15.73 mM) con- 
tains 3% nitric acid and slightly reduces the final pH of sam- 
ples (pH 7.9 instead of pH 8). In addition, we found that the 
standard (Ricca) had high absorbance at A below 220 nm 
(even when Cu** was diluted to 10 pM) and we avoided using 
it as a source of Cu** when measuring concentration of 
Cu-peptide or when preparing samples for circular dichroism 
analysis. To measure Cu’* concentration in CuSO, stock, 
solution was diluted to 40 1M into 25 mM Hepes-KOH pH 
8 and then Zincon in water (2 mM stock) was added to a final 
concentration of 50 pM. We measured the extinction coeffi- 
cient for Cu-Zincon complex to be &6.9 = 18145 M7! cm™! 
and dissociation constant Ky = 1.7 pM. The complex of 
Cu-Zincon has a maximum absorbance at 600 nm; however, 
€620 Was used to minimize contribution from unbound Zincon 
(Amax = 470 nm). 


. First, dilute concentrated HCl (12 M, ACS grade) to make a 


1 M HC! solution. 


. DMP solution (80 mM) in isopropanol is stable for at least 1 


week at room temperature. 
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5. Peptides that fibrillate quickly need to be lyophilized as soon as 
HPLC fractions are collected and samples are prepared for 
subsequent analytical HPLC and MALDI-TOF §$analysis. 
Lyophilized peptides can be stored for up to a year at —20 °C 
with containers tightly sealed. Strongly aggregating peptides 
tend to become less soluble over time and their catalytic activ- 
ity decreases. 


6. Calculate extinction coefficients (M7! cm!) of peptides at 
214 nm using the following equation [22], where €peptide bond 1S 
923 Mt cm-land amino acia(®) are Contributions from side chains 
(Table 1). For example, for a 7-residue peptide with an acyl cap 
(Ac) on the N-terminus and an amide on the C-terminus, the 
number of peptide bonds is 7 to account for the contribution 
of an amide cap. 


20 


= * * 
© nepride © peptide _bond N1 peptide _bond + pa min o_acid(i) N, min o_acid(i) 
i=l 


7. Prepare colored solution and measure UV-vis spectrum in a 
cuvette with 1 cm path length. Pipette 200 pL of the same 
solution and measure absorbance at the wavelength of choice 
using plate reader. Calculate correction coefficient using the 
following equation: 


Absorbance measured by UV — vis(1 cm) 


Correction coefficient = 
Absorbance measured by plate reader 


Table 1 
Extinction coefficients of amino acids at 214 nm (M-' cm-") 


Amino acid €44,M-'cm Amino acid €or, M1 om 
Alanine (A) 32 Leucine (L) 45 
Arginine (R) 102 Lysine (K) All 
Asparagine (N) 136 Methionine (M) 980 
Aspartic acid (D) 58 Phenylalanine (F) 5200 
Cysteine (C) 225 Proline (P) 2675 
Glutamine (Q) 142 Serine (S) 34 
Glutamic acid (E) 78 Threonine (T) 4] 
Glycine (G) 21 Tryptophan (W) 29050 
Histidine (H) 5125 Tyrosine (Y) 5375 


Isoleucine (1) 45 Valine (V) 43 
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10. 


ll. 


12. 


13. 


14. 


15. 


Different plate readers could have different correction 
coefficients even when the same plate and the same volume 
are used. 


. 50 mM solution of CuSO, in water is stable for at least a 


year; the 1 mM stock was prepared immediately before kinetic 
experiment. 


. It is important to mix the peptide and Cu** at a low pH, before 


fibrils are formed. First, mix peptide (pH 2 stock) and Cu’* solu- 
tion in water, then add Hepes-KOH buffer, pH 8. Using 28.4 mM 
Hepes-KOH will give a 25 mM final concentration of buffer in 
the sample. The change in pH from 2 to 8, and the presence of 
Cu”, triggers fibril formation. Always make Cu-peptide samples 
in pH 8 buffer immediately before the experiment; we observed 
reduced catalytic activity over time due to fibrils binding to plas- 
ticware. Some peptides need prolonged incubation time to form 
fibrils. Optimal incubation time should be established for each 
peptide family. The easiest approach would be to screen for 
peptide activity using freshly prepared Cu-peptide samples, then 
incubate leftover sample for 24 h and repeat the screen. 

Avoid formation of bubbles as they will interfere with absor- 
bance measurements. 


Final concentrations of the reagents in the well at the onset of 
the reaction are: 10 pM Cu’, 20 uM peptide, 500 pM DMP, 
2.2% isopropanol, 18.8 mM Hepes-KOH, 0.26 mM dioxygen 
(the solubility of dioxygen in air-saturated water at 25 °C and 
760 Torr [23]). With DMP substrate we observed that after a 
minute, the kinetic trace at 476 nm starts to curve; therefore 
losing a few data points in the beginning of the reaction can 
have a significant effect on the calculated rate. If peptides are 
very active, it is recommended to reduce the concentration of 
DMP to 200 uM (final concentration) or lower. 


Conditions described above use stoichiometric amount of Cu?* 
needed to saturate all binding sites in fibrils. Dividing the rate 
calculated for Cu-peptide sample by the rate of the blank sam- 
ple will show rate enhancement of the reaction in the presence 
of peptide. 

Some peptides might not be soluble enough in 10 mM HCl to 
make 4 mM solution. In this case, peptides can be weighed out 
and then dissolved in 8 M urea to make 4 mM solution without 
measuring absorbance at 214 nm (urea absorbs at 214 nm). 


Urea breaks fibrils and this step ensures that the two peptides 
mix properly. Without urea treatment, peptides in 10 mM HCl 
might exist as protofibrils and would not mix well enough to 
result in uniform fibril composition. 


To initiate fibril formation, urea needs to be diluted at least 
10-fold. 
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Fluorescent and Bioluminescent Sensors 


Chapter 5 


Ancestral Protein Reconstruction and Circular Permutation 
for Improving the Stability and Dynamic Range 
of FRET Sensors 


Ben E. Clifton*, Jason H. Whitfield*, Inmaculada Sanchez-Romero, 
Michel K. Herde, Christian Henneberger, Harald Janovjak, 
and Colin J. Jackson 


Abstract 


Small molecule biosensors based on Forster resonance energy transfer (FRET) enable small molecule sig- 
naling to be monitored with high spatial and temporal resolution in complex cellular environments. FRET 
sensors can be constructed by fusing a pair of fluorescent proteins to a suitable recognition domain, such 
as a member of the solute-binding protein (SBP) superfamily. However, naturally occurring SBPs may be 
unsuitable for incorporation into FRET sensors due to their low thermostability, which may preclude 
imaging under physiological conditions, or because the positions of their N- and C-termini may be subop- 
timal for fusion of fluorescent proteins, which may limit the dynamic range of the resulting sensors. Here, 
we show how these problems can be overcome using ancestral protein reconstruction and circular permu- 
tation. Ancestral protein reconstruction, used as a protein engineering strategy, leverages phylogenetic 
information to improve the thermostability of proteins, while circular permutation enables the termini of 
an SBP to be repositioned to maximize the dynamic range of the resulting FRET sensor. We also provide 
a protocol for cloning the engineered SBPs into FRET sensor constructs using Golden Gate assembly and 
discuss considerations for in situ characterization of the FRET sensors. 


Key words Ancestral protein reconstruction, Phylogenetic analysis, Protein engineering, 
Thermostability, Circular permutation, Forster resonance energy transfer, Fluorescence, Biosensor 


1. Introduction 


Optical biosensors, including those that rely on Férster resonance 
energy transfer (FRET sensors), allow robust, noninvasive quanti- 
fication of small molecule dynamics in biological systems [1]. 
FRET is a physical phenomenon whereby excitation of a donor 
fluorophore results in nonradiative energy transfer to an acceptor 
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fluorophore in a distance-dependent manner, which results in 
fluorescence of the acceptor fluorophore. FRET sensors can be 
constructed by fusing a specific pair of fluorescent proteins to the 
termini of a suitable recognition domain, which undergoes a con- 
formational change when it binds the small molecule of interest. 
This ligand-dependent conformational change alters the distance 
between the fluorescent proteins, causing an observable change in 
FRET efficiency, which can be observed as a change in the ratio of 
the fluorescence intensities of the donor and acceptor fluoro- 
phores. The dynamic range of the sensor is the maximum change 
in FRET efficiency between the unbound and ligand-bound states. 
In addition to having a large dynamic range, an ideal sensor would 
be specific for the target ligand and responsive over the physiologi- 
cal concentration range of the ligand; importantly, the sensor must 
also be stable for extended periods under the experimental condi- 
tions (e.g., temperature and pH) required for the biological system 
of interest. These properties of the sensor are largely determined 
by the choice of recognition domain. 

The solute-binding protein (SBP) superfamily is one set of rec- 
ognition domains that is commonly used in FRET sensors for small 
molecules [2—5 |. SBPs exhibit high affinity and specificity toward a 
diverse array of ligands, and they undergo a large conformational 
change upon ligand binding, which can be transduced into an 
optical signal in a FRET sensor construct. However, the low ther- 
mostability of existing SBPs can limit the utility of the resulting 
sensors in biological environments, especially when destabilizing 
modifications to the recognition domain, such as circular permuta- 
tion or specificity-switching mutations, are necessary to improve 
the dynamic range or specificity of the sensor [6]. 

Ancestral protein reconstruction (APR) is one method that has 
been shown to produce consistently thermostable proteins, with 
denaturation temperatures up to 40 °C greater than comparable 
mesophilic proteins [7-11]. We have previously used this method 
to reconstruct a thermostable SBP as a recognition domain for a 
robust FRET sensor for L-arginine [6]. The main steps involved in 
APR are: (1) collection and alignment of a sequence dataset repre- 
sentative of a protein family; (2) inference of a phylogenetic tree 
describing the evolutionary relationships between the protein 
sequences; (3) probabilistic reconstruction of the ancestral protein 
sequences; (4) synthesis and cloning of genes encoding the ances- 
tral proteins; and (5) expression, purification, and biophysical anal- 
ysis of the ancestral proteins. 

It has been argued that the high thermostability of reconstructed 
ancestral proteins reflects the environment of the ancient, thermo- 
philic organisms from which they originate [7, 8, 12]; thus, recon- 
struction of the ancestor of a sufficiently ancient protein family (for 
instance, one that predates the divergence of the major bacterial 
kingdoms) is a viable method for engineering a highly thermostable 


2 Materials 


2.1 Software 
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protein [13]. Notably, computational evolutionary simulations [14] 
and the observation that ancestral proteins share similarities with 
consensus proteins in terms of both sequence and thermostability 
[8, 15, 16] suggest that the high thermostability of ancient proteins 
might also originate from bias in the maximum-likelihood method 
commonly used for APR, specifically, that more probable (often sta- 
bilizing) residues are chosen at every site, while less probable (often 
destabilizing) residues are neglected [14]. The implication of this 
potential bias for protein engineering is that APR may produce ther- 
mostable proteins regardless of their hypothetical age. 

One problem that may be encountered in the construction of 
SBP-based FRET sensors is that the conformational change of the 
SBP may not translate into a change in distance between the N- and 
C-termini, which is often the case when the termini are located on 
the same domain of the bilobal SBP. A small change in the distance 
between the termini of the SBP results in a small change in FRET 
efficiency between the bound and unbound states of the sensor, 
which limits its dynamic range. In this circumstance, the dynamic 
range of the sensor can be improved by circular permutation, a 
method that allows the termini of a protein to be relocated [2, 6]. 
Circular permutation is achieved by rearranging the sequence of the 
SBP such that the original termini are connected by a flexible linker 
and new termini are created by disconnecting the sequence at a dif- 
ferent location, which is chosen to maximize the distance change 
between the termini upon ligand binding. Since circular permuta- 
tion is often destabilizing [17], the use of APR to obtain a thermo- 
stable SBP and the use of circular permutation to improve the 
dynamic range of the resulting FRET sensor are complementary. 

Here, we show how APR can be used to create thermostable 
SBPs for the construction of robust FRET sensors. In this exam- 
ple, the maximum-likelihood statistical framework is used for 
phylogenetic analysis and reconstruction of ancestral sequences, 
although a variety of alternative methods are also available 
[18-20]. We focus on the reconstruction of ancestral SBPs, but 
similar approaches could be used to improve the thermostability of 
any protein, provided that a suitable sequence dataset can be 
obtained. We also show how the dynamic range of the sensor derived 
from an ancestral SBP can be improved by circular permutation 
prior to the insertion of the SBP into the FRET sensor construct. 


1. SeaView version 4.6 (http://doua.prabi.fr/software /seaview). 
2. ProtTest version 3.4 (https: //github.com /ddarriba/prottest3). 


3. FigIree version 1.4.2 (http://tree.bio.ed.ac.uk/software/ 
figtree). 
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4. 


2.2 Online Servers 1. 


non PF WwW WN 


PHYLIP version 3.695 (http://evolution.genetics.washing- 
ton.edu/phylip.html). 


. PAML version 4.9 (http://abacus.gene.ucl.ac.uk/software/ 


paml.html). 


. PyMOL version 1.8 (https: //www.pymol.org). 


NCBI BLAST and PSI-BLAST (http://blast.ncbi.nlm-.nih. 
gov/Blast.cgi). 


. UniProt-KB BLAST (http://www.uniprot.org/blast). 

. MUSCLE (http://www.ebi.ac.uk/Tools/msa/muscle). 
. PhyML (http://www.atgc-montpellier.fr/phyml). 

. Phyre2 (http: //www.sbg.bio.ic.ac.uk/phyre2). 

. CPred (http://sarst.life.nthu.edu.tw/CPred). 


2.3 Cloning Prepare all solutions using ultrapure water. 


1. 


N QD oFr fb 


17. 
18. 


Synthetic genes encoding the circularly permuted ancestral 
proteins (see Subheading 3.7 for design). 


. Primers to amplify the circularly permuted genes with flanking 


Sap! sites for Golden Gate assembly (see Subheading 3.7 for 
design). 


. High-fidelity DNA polymerase (e.g., Phusion Hot Start I 


Polymerase, Thermo Scientific). 


. dNTPs. 

. Thin-walled PCR tubes. 

. PCR thermocycler. 

. PCR purification kit (e.g., Wizard® SV Gel and Clean-Up 


System from Promega). 


. pDOTS4 or pDOTS10 plasmid [6] (see Note 1). 
. Sapl (10 U/L) (e.g., from New England Biolabs). 
. T4 DNA ligase (400 U/pL) with 10x T4 DNA ligase buffer 


(e.g., from New England Biolabs). 


. Electrocompetent £. cols TOP10 cells (Invitrogen). 

. Electroporation cuvettes. 

. Electroporator. 

. YenB media: 7.5 g/L yeast extract, 8 g/L nutrient broth. 
. 5 mL culture tubes. 


. Luria-Bertani (LB) agar plates containing ampicillin (10 g/L 


tryptone, 5 g/L yeast extract, 10 g/L NaCl, 15 g/L agar, 
100 mg/L ampicillin). 
Materials for colony PCR and analysis by gel electrophoresis. 


PCR master mix with loading dye for gel electrophoresis (e.g., 
2x PCR Super Master Mix from Biotool). 


3 Methods 


3.1 Sequence 
Collection 


3.2 Multiple 
Sequence Alignment 


Improving FRET Sensors by Ancestral Gene Resurrection 75 


19. Primers for sequencing (see Subheading 3.8). 


20. 1% agarose gel made with SB buffer (46 g/L boric acid, 8 g/L 
sodium hydroxide), stained with, e.g., GelRed™ (Biotium). 


21. DNA ladder (e.g., GeneRuler 1 kb DNA ladder, Thermo 
Scientific). 


22. Agarose gel electrophoresis apparatus. 


Given a reference SBP with the desired binding specificity, recon- 
struction of an ancestral SBP with improved thermostability and 
the same binding specificity requires a dataset of protein sequences 
that are homologous to the reference SBP (see Note 2). These 
sequences can be identified using a BLAST (Basic Local Alignment 
Search Tool) search of protein sequence databases, such as the 
NCBI database of nonredundant protein sequences or the 
UniProt-KB database. The protein sequences should be selected to 
maximize sequence diversity and phylogenetic diversity, but should 
have the same binding specificity as the reference SBP (see Notes 3 
and 4). The sequences should be evenly distributed throughout 
sequence space; in other words, clusters of sequences with very 
high identity (>90%) and outlier sequences with low identity to 
any other sequence in the dataset should be avoided. A small set of 
outgroup sequences should also be selected for rooting the phylo- 
genetic tree (see Note 5). In the case of SBPs, a suitable outgroup 
would be an SBP homologous to the reference SBP, but with a 
different binding specificity; for example, in our recent work, 
anionic amino acid-binding proteins served as an outgroup for cat- 
ionic and neutral amino acid-binding proteins [6]. 

As a guideline, between 50 and 250 sequences should be 
selected for the phylogenetic analysis. If too many sequences are 
included, the phylogenetic analysis will be computationally inten- 
sive, while if too few sequences are included, there may be insuffi- 
cient data to reconstruct the ancestral sequences accurately, or the 
sequence diversity within the dataset may be insufficient to make 
the ancestral proteins more thermostable than the reference SBP. 


Before the sequences can be used for phylogenetic analysis, they 
must be collated in a multiple sequence alignment. In the phyloge- 
netic analysis and subsequent reconstruction of ancestral sequences, 
it is assumed that each column in the multiple sequence alignment 
corresponds to a set of residues that originated from a common 
ancestor. If the alignment is incorrect, this assumption is violated; 
hence, the quality of the multiple sequence alignment is a critical 
factor in the success of the project. The number of gaps should be 
minimized by careful editing of the alignment to remove poorly 
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3.3 Model Selection 


conserved N- and C-terminal regions and insertions that are unique 
to individual protein sequences, since PAML (the program used for 
reconstruction of ancestral protein sequences) will attempt to recon- 
struct one ancestral residue for each column in the alignment. 


1. Align the protein sequences from Subheading 3.1 using the 
MUSCLE server [21] (see Note 6). 


2. Open the alignment in SeaView [22]. 


3. Inspect the alignment carefully. If necessary, make manual 
adjustments to improve the alignment, particularly in regions 
with many gaps. 


4, Trim the alignment, removing insertions that would not be 
present in the ancestral proteins of interest (see Note 7). 
Remove poorly conserved N- and C- terminal extensions, 
including the N-terminal signal peptides in the case of SBPs. 


5. Save the alignment in PHYLIP interleaved format. 


Phylogenetic analysis and reconstruction of ancestral protein 
sequences using the maximum-likelihood method requires a prob- 
abilistic model of protein evolution, which is needed to evaluate 
the likelihood of different evolutionary scenarios. The main com- 
ponent of this evolutionary model is an empirical substitution 
matrix that specifies the frequency of each amino acid and the sub- 
stitution rate between each pair of amino acids. Some modifica- 
tions are also available to make the basic evolutionary model more 
realistic; for example, different sites in the protein are often allowed 
to evolve at different rates by modeling rate heterogeneity using 
the discrete-gamma model (denoted +G) [23]. 

The first step in maximum-likelihood phylogenetic analysis is 
to choose the evolutionary model most appropriate for the given 
sequence dataset, which can be achieved using programs such as 
ProtTest [24] (see Note 8). ProtTest evaluates different models 
using the Akaike information criterion (AIC), a metric that 
accounts for the likelihood of the tree inferred using the model 
(the goodness of fit) and the number of model parameters (to 
penalize overfitting). 


1. Open the ProtTest graphical interface and load the alignment 
file. 


2. Select “Compute likelihood scores” under the Analysis menu. 


3. Select the substitution matrices to evaluate. At a minimum, 
select the general-purpose JTT, WAG, and LG matrices; most 
of the remaining matrices are based on specific types of pro- 
teins (for example, MtArt is based on sequence data from 
invertebrate mitochondrial proteins). 


4. Ensure that the +1, +G, +1+G, and empirical amino acid fre- 
quency options are selected to evaluate these modifications. 


3.4 Phylogenetic 
Tree Inference 
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5. Select “BIONJ tree” as the base tree for likelihood calcula- 
tions (see Note 9). 


6. After the computation is complete, select “Results” under the 
Selection menu. Identify the model with the lowest AIC, 
which will be used for the subsequent analysis. 


Given an alignment of protein sequences and a probabilistic model 
of protein evolution, a phylogenetic tree can be inferred using the 
maximum-likelihood method. The goal of phylogenetic inference 
using ML is to identify the tree topology (branching pattern) and 
parameters (for example, branch lengths) that optimize the likeli- 
hood function. Likelihood is defined in this context as the proba- 
bility of observing the data—that is, the probability that the 
present-day sequences would have evolved—given the tree topol- 
ogy, tree parameters, and an evolutionary model. In practice, the 
maximum-likelihood tree is inferred by first estimating the correct 
tree using a computationally simple method such as the BIONJ 
method; heuristic tree-searching algorithms are then applied to 
modify the topology of the tree and increase its likelihood until 
convergence on the maximum-likelihood tree is achieved. 

Inference of a phylogenetic tree for the purpose of APR should 
be considered an iterative process; problematic sequences should 
be identified and removed, and alternative sequences should be 
added before repeating the phylogenetic analysis. The final tree 
should be biologically plausible, supported by high bootstrap val- 
ues, and robust to variations in the evolutionary model used. 


1. Open the PhyML [25] web interface and upload the sequence 
alignment. 


2. Select “Amino-Acids” for data type. 


3. Under “Substitution Model,” specify the substitution model 
chosen in Subheading 3.3. +I denotes an estimated propor- 
tion of invariant sites; +G denotes an estimated gamma-shape 
parameter, and +F denotes empirical amino acid frequencies. 


4. Under “Tree Searching,” select “BIONJ” for the starting tree and 
“NNI + SPR” for the type of tree improvement (see Note 10). 


5. Under “Branch Support,” select the “aLRT SH-like” branch 
support test. Alternatively, for the final tree, select a bootstrap 
calculation with 100 replicates. 


6. Once the analysis has completed, open the maximum-likeli- 
hood tree in FigTree. The tree can be rooted by selecting the 
branch connecting the ingroup and outgroup sequences and 
clicking the “Reroot” button. 


7. Inspect the tree to identify problems such as long branches 
and low branch support values, and rectify these problems by 
repeating the multiple sequence alignment and phylogenetic 
analysis using a modified sequence dataset if necessary 
(see Notes 11 and 12). 
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8. Assess the robustness of the ML tree to variations in the sub- 
stitution model by repeating the tree inference in PhyML 
using alternative substitution matrices that scored highly in 
Subheading 3.3. The resulting trees should be very similar to 
the maximum-likelihood tree. 


3.5 Reconstruction 1. Ensure that the alignment file and tree file are properly for- 
of Ancestral matted for input into PAML [26] (see Note 13). 
Sequences 2. Set up the program by modifying the control file (“codeml.ctl”), 


as shown in Table 1. 


3. Run the program from the command line using the command 


codeml. 
Table 1 
Variables in the control file for ancestral protein reconstruction 
using PAML 
seqfile Filename of alignment 
treefile Filename of tree 
outfile Filename of output 
noisy 9 
verbose 2 
runmode 0 
seqtype 2 
clock 0 
aaRatefile Filename of rate matrix (e.g., lg.dat for the LG matrix) 
model 2 
Mgene 0 
fix_alpha 0 to use +G model, else 1 
alpha 0.5 to use +G model, else 0 
Malpha 0 
neatG Number of rate categories to use for the +G model 
getSE 0 
RateAncestor 1 
Small_Diff 5e-6 
cleandata 0 
method 0 


Deviations from the settings in the default control file are indicated in bold. The 
remaining variables in the control file are not applicable to amino acid-based analysis 
and can be removed 


3.6 Characterization 
of Ancestral Proteins 


3.7 Design 
of Circularly 
Permuted SBPs 
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4. Open the result file (“rst”) and scroll down to the text “tree 
with node labels.” Copy the tree into a new file and open 
the file in FigTree. Identify the ancestral nodes of interest 
(see Note 14) and record the node labels that identify them. 


5. Search the result file for the text “node #x” (where x is the 
number identifying the ancestral node) to find the maximum- 
likelihood ancestral sequence associated with that ancestral 
node. 


6. Open the alignment file in SeaView and add the ancestral pro- 
tein sequences to the alignment. Edit the ancestral sequences 
to remove any remaining insertions that are artifacts of the 
reconstruction process (see Note 7). 


Once the ancestral protein sequences have been obtained, they are 
back-translated into nucleotide sequences and codon-optimized for 
expression in Escherichia coli, and the genes are synthesized. The 
synthetic genes are cloned into expression vectors, and the ancestral 
proteins are expressed in E. coli and purified. The thermostability of 
the ancestral proteins can be assessed using methods such as circular 
dichroism spectroscopy, differential scanning fluorimetry (DSF), or 
differential scanning calorimetry. The binding specificity of the 
ancestral proteins can be assessed using methods such as isothermal 
titration calorimetry (ITC), DSF, or (in some cases) fluorescence 
spectroscopy. If the ancestral SBPs have high thermostability and 
the desired binding specificity, they are strong candidates for circu- 
lar permutation and incorporation into FRET sensor constructs. 


The circular permutation of SBPs has been described by Okada 
et al. [2]. It is critical to identify sites for the new N-and C- termini 
that will produce a sensor with high dynamic range without nega- 
tively affecting the binding affinity or stability of the protein. 


1. Using a crystal structure or homology model of the ancestral 
SBP (created using the Phyre2 server [27], for example), select 
the positions of the new N- and C-termini. The new termini 
must be located on different lobes of the SBP to maximize the 
change in their relative positions due to the ligand-induced 
conformational change. This can be achieved by deleting a sec- 
tion of the protein that links the two lobes (i.e., a hinge strand) 
(see Note 15). The result is a theoretical protein with the origi- 
nal N- and C-termini, and additional N*- and C*-termini, i.e., 
two protein fragments. 


2. Design a linker sequence to fuse the original N- and C-termini 
of the SBP. Each repeat of a flexible (GGS)z linker is approxi- 
mately 11.4 A in length. Measure the distance between the 
N- and C-termini of the SBP; the linker should have enough 
(GGS)n repeats to bridge this distance. 
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3.8 Cloning 
into a FRET Sensor 


Construct 


3.9 Characterization 
of the FRET Sensors 
In Vitro and In Situ 


3. 


Theoretically construct the circularly permuted SBP sequence 
by concatenating the three protein fragments: the N*-terminus 
to C-terminus fragment, the flexible linker, and the N-terminus 
to C*-terminus fragment. 


. Back-translate the circularly permuted SBP sequence to obtain 


a nucleotide sequence codon-optimized for expression in 
E. coli, and synthesize the gene (see Note 16). 


. Design primers to amplify the gene with flanking SalI sites for 


insertion into pDOTS4 or pDOTS10. The forward primer 
contains the sequence 5’-GCTCTTCAATC-3’ followed by the 
first 10-15 nucleotides of the circularly permuted gene. The 
reverse primer contains the sequence 5’-GCTCTTCCGAG-3’ 
followed by the first 10-15 nucleotides of the reverse comple- 
ment of the circularly permuted gene (see Note 17). The 
length of the primers should be chosen such that the melting 
temperatures of the primers are approximately 65 °C. 


Carry out all procedures at room temperature unless otherwise 
specified. 


1. 


Using standard PCR with a high-fidelity DNA polymerase 
(e.g., Phusion Hot Start II), amplify the SBP gene using the 
cloning primers from Subheading 3.7. Purify the PCR prod- 
uct using a PCR purification kit, according to the manufac- 
turer’s instructions. 


. Ina thin-walled PCR tube, chilled on ice, mix 2 pL of 10x T4 


DNA Ligase buffer, 1 pL 400 U/pL T4 DNA ligase, 1 pL 
10 U/L Sapl, 100 ng pDOTS4 or pDOTS10, a fivefold 
molar excess of the purified PCR product from the previous 
step, and water to give a final volume of 20 pL. 


. Perform the Golden Gate assembly reaction using the follow- 


ing thermocycling protocol: 33 cycles of (37 °C for 2 min, 
then 16 °C for 3 min); 55 °C for 10 min; 80 °C for 5 min; 
then hold at 4 °C. 


. Transform one aliquot of E. coli TOP10 cells with 2.5 pL of 


the Golden Gate assembly reaction mixture by electroporation 
(see Note 18). After electroporation, resuspend the cells in 
1 mL YenB and incubate with shaking at 37 °C for 1 h. Plate 
100 pL of the culture on an LB/ampicillin agar plate and 
incubate at 37 °C overnight (see Note 19). 


. Use colony PCR and Sanger sequencing of the resulting PCR 


products to confirm correct insertion of the gene into the vec- 
tor (see Note 20). 


After expression of the FRET sensor in E. cold and subsequent puri- 
fication (see Notes 21 and 22), the dynamic range of the sensor 
and the affinity of the sensor for its ligand should be measured 


4 Notes 
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using fluorescence titrations. ITC can also be used to accurately 
measure the dissociation constant (Ky) of the sensor and the pro- 
portion of the sensor that is active. 

The approach for in situ characterization of the FRET sensor 
will depend on where the sensor will be used to measure the rest- 
ing level of its ligand, its dynamic changes or distribution. For 
instance, this could be intra- or extracellularly, in cell culture sys- 
tems, or more complex tissue such as acute slices or in vivo. While 
intracellular ligand measurements typically require the sensor to be 
expressed by the cell of interest using an appropriate vector, extra- 
cellular sensor localization can be achieved by expression using 
plasma membrane targeting sequences and anchoring motifs [4]. 
However, sensor expressed endogenously and targeted to the 
membrane can be exposed to intracellular ligands during transit to 
the membrane. Both sensor populations, surface-presented and 
intracellular, will generate fluorescence that cannot be easily sepa- 
rated using diffraction-limited microscopy techniques, thus poten- 
tially limiting the interpretation of data. An alternative approach to 
target FRET exclusively to extracellular space relies on a biotin- 
streptavidin anchoring strategy [6, 28]. In our previous work, a 
biotin tag was incorporated into the FRET sensor using the 
pDOTS10 plasmid. Extracellular immobilization of the sensor was 
then achieved by linking the sensor to surface proteins, biotinyl- 
ated using commercially available streptavidins and NHS-ester 
activated biotinylation reagents, via streptavidin [6]. 

Classically, FRET sensor imaging is performed using single pho- 
ton excitation and quantification of donor/acceptor fluorescence 
intensity, donor lifetime, or emission spectra [4]. In thicker prepara- 
tions like acute tissue slices and in vivo, two-photon excitation using 
near-infrared pulsed lasers has superior performance and provides 
good optical access to deeper structures [29] at the potential cost of 
reduced spectral separation of FRET donor/acceptor excitation 
(e.g., ECFP/EYFP) [30]. Using pulsed lasers, typically in the MHz 
frequency range (e.g., Ti:sapphire lasers around 80 MHz), also 
enables fluorescence lifetime imaging using time-correlated single 
photon counting (TCSPC-FLIM) of FRET donor fluorescence, a 
powerful tool to study binding of a ligand to a FRET sensor. Both 
modes of excitation allow monitoring of sensor fluorescence at high 
spatial and temporal, micrometer and millisecond, resolution. 


1. pDOTS4 and pDOTS10 are mother plasmids for the cloning 
of SBPs to generate FRET sensors [6]. The pDOTS4 back- 
bone is based on a pRSET plasmid [4] (pRSET FLIPE-600n, 
Addgene #13537, courtesy of Wolf Frommer) containing a 
6x His-Tag at the N-terminus. For the construction of 
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pDOTS4, the native Sapl site was first removed by site- 
directed mutagenesis. To adapt this plasmid for Golden Gate 
assembly [31], the ybeJ coding sequence in pRSET FLIPE- 
600n was removed and replaced with Sapl- NotI-Sapl restric- 
tion sites. The SBP gene can be introduced into the pDOTS4 
plasmid using Golden Gate assembly (see Subheadings 3.7 
and 3.8); the SBP will be fused to a 6x His-Tag and ECFP at 
the N-terminus and to Venus fluorescent protein at the 
C-terminus. To generate pDOTS10, a biotin tag followed by 
a GSSG linker, synthesized de novo (Epoch Life Science) 
based on the sequence from the PinPoint” Xa-1 plasmid, was 
amplified by PCR and cloned into pDOTS4 between the 6x 
His-Tag and ECFP using the BamHI restriction site. The SBP 
gene can be introduced into the pDOTS10 plasmid using 
Golden Gate assembly (see Subheadings 3.7 and 3.8); the SBP 
will be fused to a 6x His-Tag, biotin tag, GSSG linker, and 
ECFP at the N-terminus, and to Venus fluorescent protein at 
the C-terminus. pDOTS4 and pDOTS10 will be made avail- 
able via Addgene. 


. Usually, protein sequences are used for ancestral reconstruc- 


tion rather than nucleotide sequences because they are more 
highly conserved and therefore retain more phylogenetic sig- 
nal over longer evolutionary distances; protein sequences are 
under stronger selection than nucleotide sequences because 
many nucleotide substitutions are synonymous and have low 
fitness costs. However, nucleotide- or codon-based phyloge- 
netic analyses are also possible. 


. Information about sequence-function relationships from crys- 


tal structures of the reference SBP or other experimental data 
is useful for identifying protein sequences that are likely to 
have the same or similar binding specificity. If the structure of 
the reference SBP is unknown, it may be possible to predict 
binding site residues using the structures of homologous 
SBPs; these residues should be conserved when the binding 
specificity is conserved. 


. Several strategies can be used to increase the diversity of a 


sequence dataset: PSI-BLAST, which performs iterative 
BLAST searches, can be used to find sequences more distantly 
related to the reference SBP; multiple SBP sequences can be 
used as input for the BLAST search (provided that they are 
actually homologous); programs such as CD-HIT [32] can be 
used to filter redundant sequences out of the dataset; BLAST 
searches can be restricted to certain taxonomic groups to 
obtain sequences from phylogenetically diverse organisms. 
Phylogenetic diversity in the sequence dataset is desirable to 
enable reconstruction of more ancient (and more thermosta- 
ble) ancestral proteins. 


10. 
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. The root of a phylogenetic tree is the node corresponding to 


the ancestor of all sequences in the tree. Rooted trees show the 
position of the root and the direction of evolution, whereas the 
unrooted trees produced by the maximum-likelihood method 
contain no information about the direction of evolution. This 
problem can be circumvented by the inclusion of outgroup 
sequences, which are known to be more distantly related to the 
sequences of interest (ingroup sequences), in the phylogenetic 
analysis. The root of the tree will be located somewhere on the 
branch joining the ingroup and outgroup sequences. 


Various alignment programs with different specialities are avail- 
able. In addition to MUSCLE, these include MAFFT [33], 
T-COFFEE [34], and PRANK [35]. In particular, PRANK 
may be useful for APR, since its phylogeny-aware algorithm is 
able to distinguish independent insertion and deletion events 
that would result in erroneous inferences of homology in con- 
ventional multiple sequence alignment programs [36]. 


Programs for automatic curation of multiple sequence align- 
ments are available (such as Gblocks [37 ]), but these programs 
may be too zealous in removing poorly aligned regions of the 
alignment for this particular application. Although insertions 
unique to individual sequences or a small cluster of sequences 
can be safely removed, care should be taken not to delete col- 
umns that might contain residues present in the ancestral pro- 
teins of interest. Since phylogenetic information is needed to 
make this judgement, the alignment should be edited conser- 
vatively before the reconstruction of ancestral sequences. 
Artifactual insertions in the ancestral sequences can be 
removed afterward, once the phylogeny is known. 


. More recent implementations of maximum-likelihood meth- 


ods may give superior performance in terms of computational 
speed and/or accuracy. Alternative software options include 
IQ-TREE [38] for model selection, IQ- TREE or RAxML 
[39] for tree inference, and FastML [40] for reconstruction of 
ancestral sequences. 


. The assumption implicit in this choice is that the BLIONJ tree 


is a good enough approximation to the maximum-likelihood 
tree that the substitution model of best fit will be identical for 
both trees. It is more accurate to use the maximum-likelihood 
tree as the base tree for the likelihood calculations, but the 
computation will be much slower. 


If these settings are used, the tree search will be deterministic, 
that is, the same alignment will always produce the same tree. 
This can be problematic, since the tree search may stall at a 
locally optimal tree and fail to converge on globally optimal 
tree (the true maximum-likelihood tree). It is therefore good 
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11. 


12. 


13. 


14. 


practice to repeat the analysis in PhyML with randomized start- 
ing trees; if the same tree topology is recovered each time, it is 
more likely that the globally optimal tree has been recovered. 


Long branches on a phylogenetic tree arise from the inclusion 
of highly divergent sequences and cause a systematic error 
known as long branch attraction. If possible, long branches 
should be broken up by including additional sequences that 
have higher identity to the divergent sequences; otherwise, 
the divergent sequences may need to be removed. 


Phylogenetic trees inferred using maximum-likelihood can be 
validated in several ways. First, the topology of the tree should 
be robust to variations in the evolutionary models used in the 
calculation; the phylogenetic analysis should be repeated using 
alternative substitution matrices that scored highly in 
Subheading 3.3, and the resulting tree topologies should be 
very similar to the maximum-likelihood tree. Second, statisti- 
cal support for individual branches in the tree should be 
assessed using the bootstrap method, which involves repeating 
the phylogenetic analysis many times (usually at least 100) 
using multiple sequence alignments generated by random 
resampling of the complete alignment, to determine whether 
the same tree topology can be reproduced even when some of 
the data are removed or duplicated. The bootstrap value of a 
branch is the proportion of the bootstrap trees that contain 
the same bifurcation of sequences as denoted by that branch. 
Branches with bootstrap values <70% should be viewed with 
suspicion. Finally, if the tree describes the evolution of orthol- 
ogous sequences, it should be consistent with established 
species-based trees (although this criterion is less applicable to 
SBPs due to the prevalence of horizontal gene transfer). 


In our experience, the file formatting requirements of PAML 
are the most common cause of any errors encountered while 
running the program. The alignment must be in PHYLIP 
interleaved format. This format must be specified by adding 
the letter “I” (for “interleaved”) to the end of the first line 
(after the number of sequences and number of columns). 
There must also be two spaces between each sequence name 
and the corresponding sequence. The tree must be in Newick 
format. PAML can only parse trees obtained from PhyML if 
they do not contain branch support values. Trees with branch 
support values can be reformatted using the program Retree 
in the PHYLIP package; open the tree in Retree and write the 
(unrooted) tree to a new file without further modification. 


Ideally, multiple ancestral nodes between the reference SBP 
and the last common ancestor of the sequences of interest 
would be selected and characterized experimentally. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
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Previous examples of circularly permuted SBPs [2, 6] can be 
used as a guide to identify potential sites for circular permuta- 
tion, provided that the proteins share sufficient homology. 
Otherwise, select sites in disordered loops of the protein (as 
indicated by high B-factors in the crystal structures of related 
proteins), and avoid the removal of any residues that contribute 
to secondary structure. Multiple sites may need to be tested. 
As a further check, the CPred server [41] can be used to evalu- 
ate the viability of potential sites for circular permutation. 


Ensure that no Sapl sites (GCTCTTC) are present in the syn- 
thetic gene, as these will prevent successful cloning into 
pDOTS4 or pDOTS10. 


Cloning into the pDOTS4 or pDOTS10 plasmids by Golden 
Gate assembly results in addition of an N-terminal isoleucine 
residue and a C-terminal leucine residue to the cloned gene, 
which should be taken into account if the linkers between the 
fluorescent proteins and recognition domain are optimized. 


Competent cells with high transformation efficiency (107— 
108 cfu per pg DNA) are required for this step. If the transfor- 
mation efficiency of the competent cells is low, plate the entire 
volume of electroporated cells by pelleting the cells (18000 xg 
on a benchtop centrifuge for 1 min), discarding the superna- 
tant, resuspending the pellet in 100 pL YenB media, and plat- 
ing the resulting suspension. 


If no colonies are observed, the ligation by T4 DNA ligase 
may have been unsuccessful; if the colonies only contain empty 
vectors, digestion of the template gene by SapI may have been 
unsuccessful. 


We recommend sequencing in two reactions, first using a 
T7-specific primer (forward) in combination with a SBP- 
specific primer (reverse), then using an SBP-specific primer 
(forward) with a T7-specific primer (reverse), because the 
length of the full construct exceeds the length of most Sanger 
sequencing reads. 


High levels of expression can be achieved using pDOTS4 or 
pDOTS10 by incubation of E. colt BL21(DE3) transformants 
in auto-induction media at 20 °C for 72 h. Expression can be 
monitored during growth by removing a 1 mL aliquot from 
the culture, centrifuging the sample to pellet the cells, resus- 
pending the pellet in 1 mL water, and measuring the fluores- 
cence spectrum of the resulting cell suspension. With excitation 
at 433 nm, an emission peak at 476 nm corresponding to 
ECFP and an emission peak at 525 nm corresponding to 
Venus should be apparent. Since maturation of the Venus fluo- 
rophore is slower than maturation of the ECFP fluorophore, 
the Venus peak may not be observed until after 48 h. 
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22. The FRET sensors are expressed with N-terminal hexahisti- 
dine tags, allowing purification by nickel affinity chromatogra- 
phy. In our experience, further purification by size-exclusion 
chromatography can also improve the dynamic range of the 
sensor. Finally, extensive dialysis may be required to remove 
any endogenously bound ligands, as sensor molecules that are 
already ligand-bound do not exhibit a ligand-dependent 
change in FRET, reducing the observed FRET efficiency. 
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Chapter 6 


Method for Developing Optical Sensors Using a Synthetic 
Dye-Fluorescent Protein FRET Pair and Computational 
Modeling and Assessment 


Joshua A. Mitchell, William H. Zhang, Michel K. Herde, 
Christian Henneberger, Harald Janovjak, Megan L. 0’Mara, 
and Colin J. Jackson 


Abstract 


Biosensors that exploit Forster resonance energy transfer (FRET) can be used to visualize biological and 
physiological processes and are capable of providing detailed information in both spatial and temporal 
dimensions. In a FRET-based biosensor, substrate binding is associated with a change in the relative posi- 
tions of two fluorophores, leading to a change in FRET efficiency that may be observed in the fluorescence 
spectrum. As a result, their design requires a ligand-binding protein that exhibits a conformational change 
upon binding. However, not all ligand-binding proteins produce responsive sensors upon conjugation to 
fluorescent proteins or dyes, and identifying the optimum locations for the fluorophores often involves 
labor-intensive iterative design or high-throughput screening. Combining the genetic fusion of a fluores- 
cent protein to the ligand-binding protein with site-specific covalent attachment of a fluorescent dye can 
allow fine control over the positions of the two fluorophores, allowing the construction of very sensitive 
sensors. This relies upon the accurate prediction of the locations of the two fluorophores in bound and 
unbound states. In this chapter, we describe a method for computational identification of dye-attachment 
sites that allows the use of cysteine modification to attach synthetic dyes that can be paired with a fluorescent 
protein for the purposes of creating FRET sensors. 


Key words Synthetic dye, Optical sensor, Computational modeling, Forster resonance energy transfer 


1. Introduction 


Optical sensors have allowed for the investigation of physiological 
processes such as neurotransmission with both spatial and tempo- 
ral resolution. FRET-based optical sensors are particularly useful, 
as they are capable of giving quantitative recordings independent 
of sensor concentration due to the ratiometric signal output of the 
sensor and the concentration independence of FRET donor 
lifetimes [1, 2]. Contemporary sensors typically use fluorescent 
proteins (FPs). However, some have used synthetic fluorescent 
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2 Materials 


dyes rather than FPs as the signaling component of the sensor. As 
there is much greater control over the precise location of synthetic 
fluorophores, synthetic dyes can theoretically allow for even small 
conformational changes to produce a measurable FRET signal. 
While a large conformational change for a given protein is always 
desirable for sensor construction, it is often a necessity when using 
FPs, meaning that synthetic dyes are potentially applicable across a 
much larger range of proteins, rather than being restricted to those 
with large distance-based conformational changes. 

Sensors that use synthetic dyes are either intensity-based sen- 
sors (non-FRET), which require multiple synthetic components, 
or must be developed through extensive high-throughput screen- 
ing [3, 4]. For example, in addition to the use of two synthetic 
dyes, the Snifit-type sensor design requires the development and 
synthesis of a tethered competitive ligand that can occupy the 
binding active site, which necessarily introduces an extra design 
phase of engineering and screening [3]. On the other hand, the 
EOS-type sensor developed by Namiki et al. only requires a single 
synthetic component (a dye) [4], but still requires exhaustive 
screening of different residues to find a location that gives a strong 
signal. In addition, EOS sensors do not produce ratiometric out- 
put and are therefore not quantitative. 

It is possible to improve on one or more of these design com- 
ponents when creating a synthetic dye-based sensor. Specifically, it 
has been shown that it is possible to create FRET sensors that are 
a combination of one FP and one synthetic dye [5], which is an 
improvement over both single fluorophore and two dye sensors as 
it is ratiometric and requires one fewer site-specific modification. 
Additionally, rather than using brute force high-throughput 
screening of all residue locations to identify a dye labeling site, we 
have expedited sensor development through the use of computa- 
tional screening, which can reduce the number of possible dye 
labeling sites to a subset with a higher likelihood of yielding a func- 
tional sensor. We have used these computational techniques in tan- 
dem with synthetic dyes (via thiol-maleimide labeling of residues) 
to develop optical sensors with large dynamic ranges. 


Whenever possible, prepare all stock solutions and buffers in ultra- 
pure water (MilliQ). For reagents that have poor solubility in water, 
dissolve them with the smallest possible proportion of organic sol- 
vent (i.e., 5% DMSO would be preferable to 10% DMSO) as some 
proteins may have poor stability in organic solvent. All solutions 
and reagents should be prepared as fresh as possible as some reagents 
will have short lifetimes when in solution, even at —20 °C. All 
buffer solutions should be filtered through a membrane filter 
(pore size 0.45 pm or smaller) after preparation. 


2.1 Software 


2.2 Cloning 
and Protein 
Purification 
Components 


2.3 Dye Labeling 
Components 
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. Python 2.7.11 (www.python.org). 

. Bash 4.2.46 (www.gnu.org/software/bash/). 

. GROMACS 5.1.2 (www.gromacs.org) [6]. 

. MARTINI 2.1 (http://md.chem.rug.nl/) [7]. 

. PyMol 1.7.6.0 (https://sourceforge.net/projects/pymol). 
. GAWK 4.0.2 (www.gnu.org/software/gawk/). 

. DSSP 2.2.1 (http://swift.cmbi.ru.nl/gv/dssp/). 

. Grep 2.5.1 (https://www.gnu.org/software/grep/). 

. Curl 7.29.0 (https: //curl.haxx.se/). 


. The gene of interest in an expression vector (see Subheading 3.3). 
. Primers containing the mutation of interest (see Subheading 3.3). 
. High-fidelity DNA polymerase kit. 

. Gibson assembly kit. 

. Thin-walled PCR tubes. 


PCR thermocycler. 


. PCR purification kit. 

. Competent cells for cloning (Top10). 

. Competent cells for protein expression (BL21DE3). 

. Materials for colony PCR and analysis by gel electrophoresis: 


(a) PCR master mix. 
(b) T7 primers. 


(c) 1% agarose gel made with SB buffer (46 g/L boric acid, 
8 g/L sodium hydroxide), with a visualizing stain. 


(d) DNA ladder mix. 


(e) Agarose gel electrophoresis apparatus. 


. Buffer solution, Phosphate buffer (see Note 1): 0.05 M Sodium 


phosphate, 0.2 M NaCl. Adjust the pH to what is appropriate 
for both the protein of interest and the chemistry that is needed 
to label the protein with the synthetic dye (using either hydro- 
chloric acid or sodium hydroxide). 


. TCEP stock solution: dissolve 0.1437 g of Tris(2-carboxyethyl) 


phosphine hydrochloride (TCEP-HCl) in 1 mL of buffer 
solution (501 mM stock solution) (see Note 2). 


. Dye stock solution: Add a suitable solvent to the synthetic dye 


to achieve a stock solution with a final concentration of 
10 mM. In the case of the Alexa Fluor 532 C5 Maleimide, 
1 mg was dissolved in 123 pL of phosphate buffer (10 mM 
stock solution) (see Note 3). 
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3 Methods 


3.1. Computational 
Screening and Residue 
Selection 


4. Protein solutions: proteins should be concentrated as much as 
practical (ideally at least 500 sM) and exchanged or dialyzed 
into the same buffer solution as used to prepare the reagent 
stock solutions (see Note 4). 


All experimental (noncomputational) work should be performed 
at 4 °C unless otherwise specified. 


First, prepare models of the target fusion protein in both its bound 
and unbound conformations. This involves modeling both the 
solute-binding and fluorescent domains with the appropriate 
linker. Start with crystal structures or high-quality homology mod- 
els of the desired binding core in both conformations and the fluo- 
rescent protein. Ensure no nonstandard residues exist in the 
models, as these are not parameterized in the MARTINI forcefield 
(see Note 5). Reconstruct any residues missing from the crystal 
structures at the termini by which they are fused. Then, construct 
the linker sequence as expressed experimentally (see Note 6). 
Complete the model by fusing the two domains (see Note 7). Save 
both SBP-linker-FP models as .PDB files. 

Create a directory for each conformation, copy the appropriate 
.PDB file into each, and also copy simulations.sh into each (see 
Note 8). The script simulations.sh automatically prepares and runs 
a number of MARTINI simulations. It depends on all of the soft- 
ware in Subheading 2.1 except MARTINI, which is downloaded 
automatically by the script, provided all software dependencies are 
available in your $PATH (see Note 9). The script can be config- 
ured by making a copy of it in each conformation directory and 
editing the leading “CONFIG” portion. Most defaults should be 
appropriate; however, the input_file, system_name and linker_resi- 
dues variables should be set for your system (see Note 10). 
Configure and run the script, read and follow the directions given, 
then repeat for the other conformation. 

Run the second script “process-data.py,” giving as the first two 
arguments the locations of each set of output files. The residue 
number of the central residue of the FP fluorophore and the range 
of residues that should be checked for sensor generation should 
also be given as arguments, respectively, with the -f and —r switches 
(see Note 11). process-data.py reads the given .PDB files, and pre- 
dicts dynamic ranges for sensors formed by labeling each residue in 
the given range with a dye with configurable Forster distance. This 
output is stored by default as comma-separated values with appro- 
priate headers in sensor_predict.csv and can be visualized graphi- 
cally using a program such as Excel. 


3.2 Cloning 
and Purification 
of the Mutant Proteins 
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Fig. 1 A graphical representation of the script output. The data shown in this 
instance is hypothetical and is not based on a true set of calculations. The script 
determines the dynamic range for a hypothetical sensor that has been labeled 
with a fluorescent dye at a given residue. For example, for a construct with an 
ECFP fused at the N-terminus of the binding protein, a sensor construct that has 
been labeled at residue 50 is predicted to have a very small or nonexistent 
dynamic range. Conversely, for a sensor that has been labeled at residue 200, 
the dynamic range should be large, as it is approaching the theoretical maximum 
dynamic range possible for the construct 


Select residues that are appropriate for cysteine mutagenesis 
(or mutagenesis to an appropriate residue). Note that this predic- 
tion does not account for any disruption of binding core function 
associated with chemical labeling. Therefore, a residue that yields 
the largest predicted dynamic range may not necessarily yield the 
best sensor, as the residue may have some structural or functional 
importance, which may be disrupted with mutagenesis. Residues 
with side chains oriented toward the solvent, or that are not a part 
of a structural motif should be selected preferentially. In the hypo- 
thetical data set example (Fig. 1), residue 200 is predicted to yield 
a large dynamic range upon labeling with a dye. Suppose, however, 
that for this hypothetical protein residue 200 is both not exposed 
to solvent and has its sidechain oriented toward the binding site of 
the protein (Fig. 2). Mutating and labeling this residue may abolish 
protein function, as the sterically bulky dye excludes the substrate 
from the active site. In contrast, although residue 100 has a smaller 
predicted dynamic range than residue 200, it is located on a flexible 
loop that does not have significant contact with other structural 
features of the binding protein. This makes this location preferable 
to residue 200, as labeling the residue is less likely to impact the 
correct function and dynamics of the binding domain, while still 
providing an excellent dynamic range. 


1. The sensor construct (SBP fused with the fluorescent protein) 
should be first cloned into an expression vector (with a T7 
promoter). The sequence to be cloned should match the 
sequence used to model the sensor exactly, with the exception 
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Fig. 2 A generic structure of an amino acid-binding protein (PDB 3IP9) used to illustrate that residue location 
and function needs to be considered along with the computational prediction. Residues 100 (b/ue) and 200 
(red) are shown. Although labeling residue 200 would produce a sensor with the theoretically largest dynamic 
range (Fig. 1), in reality, as this would require the dye to occupy the ligand-binding site between the two 
domains, it would not result in a functional sensor. Alternatively, residue 100 is also predicted to yield a sensor 
with a significant dynamic range and is located on a flexible loop, where chemical modification and mutagen- 
esis is less likely to negatively impact correct function of the binding protein 


that there can be a histidine tag at the C-terminus of the 
fluorescent protein to facilitate protein purification. 


2. Next, cysteine mutants of this sensor should be created at the 
residue locations identified by the computational screening 
(see Note 12). Any unwanted surface cysteines should be 
mutated to alternative residues to avoid nonspecific labeling 
(see Note 13). Although many cloning methods are suitable to 
introduce mutations, our preferred method is Gibson assembly. 


3. In order to create the cysteine mutants through Gibson assem- 
bly, first synthesize or order a set of complementary primers 
(forward and reverse primers encoding the same sequence) 
that encompasses the residue of interest, with the total length 
of the primer between 30 and 50 nucleotides. The nucleotides 
coding the residue of interest should be changed to encode a 
cysteine residue, all other residues should match the template 
DNA exactly. 


4. For the PCR, these primers will then be paired with the T7 
promoter/terminator primers for PCR amplification. The T7 
promoter forward primer is paired with the reverse mutagenic 
primer, while the T7 terminator reverse primer is paired with 
the mutagenic forward primer. 


3.3 Dye Labeling 
and Purification 


10. 


1. 
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. The PCRs using these primer sets should result in two fragments, 


with one fragment overlapping with the T7 promoter region 
and the mutagenic region, and the other fragment overlapping 
with the mutagenic region and the T7 terminator region. The 
mutagenic regions of these fragments will overlap and anneal 
during Gibson assembly. Gel purification is highly recom- 
mended to improve cloning efficiency, even if the agarose gel 
electrophoresis of the PCR product shows a single clear band. 


. Linear vector for use in the Gibson assembly reaction can be 


made through PCR using complementary primers of the T7 
regions. Specifically, a T7 terminator forward primer and a T7 
promoter reverse primer should be used. The linearized vector 
produced from this PCR reaction will have T7 regions that can 
overlap with the T7 regions of the gene fragments for the 
Gibson assembly reaction. 


. The PCR fragments (for both the sensor and vector) can then 


be combined into the Gibson master mix (in equimolar ratios). 
The typical incubation time for the master mix is 50 °C for 1h. 


. The Gibson assembly mixture should then be transformed into 


competent cells that are designed for DNA cloning (e.g., 
TOP10 cells). If electrocompetent cells are used, the Gibson 
mix can be diluted with water to prevent arcing. 


. After confirming that the cloning is successful through 


sequencing, the gene should be transformed into an expression 
cell type (e.g., BL21DE3). 


If the sensor construct contains a histidine tag, it can then be 
purified through nickel affinity chromatography. 


To a volume of 849 pL buffer add 100 pL of the concentrated 
protein solution (assuming the concentration of the protein 
solution is 500 pM) and 1 pL of the TCEP solution. Wait 
approximately for 5 min to allow this solution to equilibrate to 
room temperature and for any disulfides to be reduced. This 
reaction can be performed in an Eppendorf tube or an equiva- 
lent (see Note 14). 


. To the reaction mixture add 50 pL of the dye stock. The final 


composition of the reaction mixture should contain approxi- 
mately 50 pM of protein, 500 pM TCEP, and 500 uM dye. 
The reaction should be allowed to proceed overnight (16 h) at 
4 °C in the dark with constant agitation (see Note 15). 


. After the reaction period, centrifuge the reaction mixture at high 


speed (18,000 xy, 5 min) to separate out any precipitated dye 
or protein (see Note 16). 


. The mixture should then be purified through gel filtration, 


with a desalting column usually being sufficient (see Note 17). 
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4 Notes 


. Moles of dye permole protein = 


5. After one round of purification with the desalting columns, the 


protein mixture should be re-concentrated and_ buffer 
exchanged using centrifugal protein concentrators, which will 
remove trace TCEP and further remove unreacted and free 
dye. 


. The protein should then undergo a final desalting step to 


ensure that no free dye or TCEP remains and labeling effi- 
ciency can be evaluated using the following equation. 


ExC 


where for a given sample “A” is the absorbance at the peak 


oc 9) 


excitation wavelength of the dye in use, “e” is the extinction 
coefficient of the dye, and “C” is the concentration of protein. 


. Phosphate buffer might not be compatible with all proteins; 


substitute with an appropriate buffer if needed, but ensure that 
the pH range is compatible with the dye system in use. In the 
case of thiol-maleimide conjugations, the desired pH is 
between 7.0 and 7.5. 


. TCEP, or reducing agents in general, are typically only necessary 


for thiol based conjugation. TCEP is far preferable to other 
reducing agents such as DTT, as under normal conditions 
TCEP will not interfere with the maleimide-thiol reaction and 
also does not have issues with odor. 


. Sometimes there may be trace precipitate or undissolved dye in 


the stock solution. This is not a cause for concern so long as 
the precipitate is suspended homogenously prior to use. This 
trace precipitate will dissolve slowly over time as the reagent is 
consumed. 


. The protein should be purified to the highest degree possible, 


as other proteins could be labeled by the dye, which can affect 
the observed dynamic range of the protein sample. If additional 
purification is needed, size exclusion chromatography can be 
performed either before or after the labeling reaction. 


. For fluorescent proteins, this generally means mutating the fluo- 


rophore back to the three autocatalytic residues that form it; 
other proteins may incorporate nonstandard residues as a result 
of chemical modification for crystallization, etc. Unrecognized 
residues can be mutated to glycine by opening the .PDB file in 
a text editor, deleting the side chain atoms, and changing 
the residue names to GLY. They can then be mutated to the 
appropriate residue with PYMOL’s mutagenesis wizard. 


6. 


10. 
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We found this easiest to do by working out from both domains 
and allowing the constructed extended linker model to meet in 
the middle. Residues can be added to an existing model in 
PyMOL in Editing mode, accessed by clicking the mouse key 
shortcuts box in the lower right corner of the viewer window 
while in the default Viewing mode. Once in Editing mode, 
select the N-terminal nitrogen or C-terminal carbonyl carbon 
by clicking on it, then add residues by holding Alt and typing 
the single-letter code associated with the desired amino acid. 


. In PyYMOL’s Editing mode, select both atoms that should be 


bonded in the product, then run PyMOL’s fuse command. 
For example if a model with an ECFP on the N terminus of the 
protein is desired, start by selecting both the amine of the N 
terminus of the protein and the carbonyl carbon of the C ter- 
minus of the ECFP. Then, while both atoms remain selected, 
enter “fuse” as a command input, which will generate an 
approximate model of the fusion protein. The fuse command 
may sometimes orient the proteins poorly; make sure to rotate 
the proteins as such that they do not overlap in physical space 
and the linker is fully extended. This can be done in editing 
mode by holding shift and right-clicking on a bond to rotate 
the associated torsion angle. 


. For example, you may be working in a subfolder of your home 


directory called ~/dyes. You should create new directories for 
each conformation, perhaps ~/dyes /open and ~/dyes/closed. 
You add the script and starting structure to each directory, and 
are left with the following files: 
~/dyes/open/open-start.pdb 
~/dyes/open/simulations.sh 
~/dyes/closed/closed-start.pdb 
~/dyes/closed/simulations.sh 


. The script will then generate folders for setup, each run, and 


the trimmed, fitted trajectories as pdb files: 
~/dyes/open/setup/ 
~/dyes/open/runl/, ~/dyes/open/run2/, 
~/dayes/open/run3/ etc. 
~/dyes/open/results/ 


The version numbers given in the materials are known to 
work; other versions will probably work as well but have not 
been tested. Note that MARTINIZE.py, which is down- 
loaded and run by simulations.sh, is not compatible with 
Python 3, and thus Python 2 must be available on your sys- 
tem for the setup steps. The typical name for the DSSP exe- 
cutable varies from system to system; simulations.sh can be 
told the correct name for your system either by editing the 
dssp_name variable or by passing the correct name as an argument 
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11. 


with the —dssp_name switch. Finally, GROMACS versions 
prior to 5.0 are not supported. 


By default, simulations.sh prepares, equilibrates, and runs all 
simulations it is asked to without taking a break. However, if it 
is terminated, it can restart from the beginning of the last step 
it finished successfully, so it may be used (with some modifica- 
tion) as a resubmit script for systems like PBS. If preferred, the 
-o switch can be supplied to the script, which will perform the 
setup steps and generate individual run folders, but will not 
perform the full-length production runs. The full-length simu- 
lations can then be run on whatever hardware is appropriate by 
simply running gmx grompp on the provided .MDP files and 
starting structures. 

The script first prepares the MARTINI coarse-grained 
force field topology, and converts the provided .PDB file to a 
coarse-grained model with the script Martinize. This includes 
construction of an elastic network around both protein 
domains, which keep their conformations constant and allow 
sampling of linker collapse. It then energy minimizes and sol- 
vates the model, including addition of neutralizing ions, anti- 
freeze MARTINI water, and experimental salt concentration if 
desired. It performs a second energy minimization, and then 
equilibrates the system with thermostat and barostat in several 
rounds of progressively weaker backbone-restrained MD. Force 
constants of 1000, 500, 100, 50, and 10 are used by the script. 
The production run involves 30 replicate 200 ns simulations. 
New velocities are generated for each of these runs in a final 
unrestrained equilibration step. These simulations take approx- 
imately 90 min each for a 600 residue fusion model running 
on dual Tesla K40s with 32 cores. Finally, it outputs the last 
500 ns of each simulation, sampled in 1 ns intervals, as a PDB 
file to the results directory. 

For instance, if step 2 was performed in directories called 
~/dyes/open and ~/dyes/closed, and the script is being run 
in ~/dyes, per the example in Note 8, a typical call for a fusion 
model with the binding protein occupying residues 242-586 
and the fluorophore at residue 63 might be: 


python process-data.py open/results/*. 
pdb closed/results/*.pdb -f 63 -r 242-586 


12. While cysteine mutagenesis is a functional method of chemi- 


cally labeling a protein with a synthetic dye, alternative chem- 
istries are possible through the use of unnatural amino acid 
incorporation. This can allow for biorthogonal labeling of 
proteins in vivo, and in principle, it can allow for the sensor to 
be genetically encoded in an organism capable of utilizing the 
required unnatural amino acid. 


13. 


14. 


15. 


16. 


17. 
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There are two conserved cysteine residues in the GFP family; 
we have had success with the mutations C49S and C71V with 
minimal fluorescence loss, as described by Suzuki et al. [8]. 


The reaction volume and reagents should be scaled appropriately, 
relative to the concentration of the protein stock solution. 


When agitating the solution, care should be made that the 
solution does not begin to form froth or foam as this can lead 
to precipitation of protein. 


Filtration can be used instead of centrifugation; however, there is 
typically some volume/yield loss when filtering small volumes. 


PD-10 columns (GE healthcare) are usually sufficient to sepa- 
rate free dye from the protein. If the purity of the protein is a 
concern, the protein should be first buffer exchanged to 
remove excess TCEP and then purified with SEC (GE health- 
care, Hiload 26/600 superdex 200pg, adequate for most pro- 
teins). This should separate labeled protein from free dye and 


any contaminant proteins. 
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Chapter 7 


Rational Design and Applications of Semisynthetic 
Modular Biosensors: SNIFITs and LUCIDs 


Helen Farrants, Julien Hiblot, Rudolf Griss, and Kai Johnsson 


Abstract 


Biosensors are used in many fields to measure the concentration of analytes, both in a cellular context and 
in human samples for medical care. Here, we outline the design of two types of modular biosensors: 
SNAP-tag-based indicators with a Fluorescent Intramolecular Tether (SNIFITs) and LUCiferase-based 
Indicators of Drugs (LUCIDs). These semisynthetic biosensors quantitatively measure analyte concentra- 
tions in vitro and on cell surfaces by an intramolecular competitive mechanism. We provide an overview of 
how to design and apply SNIFITs and LUCIDs. 


Key words Biosensors, Protein engineering, Protein switches, Intramolecular ligands, Self-labeling 


proteins, SNAP-tag, CLIP-tag, Polyproline linkers, Therapeutic drug monitoring, Cell-surface 
biosensors 


1. Introduction 


1.1 SNIFITS Small molecule analytes are central to many biological systems. 
and LUCIDs: The ability to quantify such analytes is important in basic research 
Semisynthetic to better understand anabolic, metabolic, and signal relay pro- 
Modular Biosensors cesses. It is important to quantify small molecules not only in basic 


research, but also in healthcare, where it is crucial to be able to 
monitor the levels of drugs and metabolites for therapeutic pur- 
poses. The use of biosensors is one method for this quantification, 
and has many different applications [1, 2]. 

Several natural proteins bind small molecule analytes. If such a 
binding protein undergoes a conformational change upon analyte 
binding, it can be used as part of a biosensor system [3, 4]. For 
example, it can be sandwiched in between two fluorescent proteins 
that are Forster resonance energy transfer (FRET)-partners [5, 6] 
or between two bioluminescent resonance energy transfer (BRET)- 
partners [7]. In both cases, the change in distance and orientation 
between the two resonance energy transfer (RET) partners is trans- 
lated into changes in the optical properties of the system, which 
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can be used to give a quantitative output. However, many binding 
proteins do not undergo a sufficiently large conformational change 
when they bind to the analyte of interest. For these binding pro- 
teins, modular biosensors can be used to give larger artificial con- 
formational changes (see [8], and references therein). Our 
laboratory has introduced two such examples: SNAP-tag-based 
indicators with a Fluorescent Intramolecular Tether (SNIFITs) 
and LUCiferase-based Indicators of Drugs (LUCIDs). 

SNIFITs and LUCIDs are semisynthetic sensors composed of 
(1) a recombinant fusion protein and (2) a synthetic intramolecular 
tether (Fig. 1). The fusion protein is composed of a binding protein 
for an analyte of interest, a RET donor, and a self-labeling protein 
(SLP). The intramolecular tether is covalently attached to the SLP by 
a specific bio-orthogonal reaction, and contains a competitive ligand 
for the binding protein. In the absence of the analyte, the intramo- 
lecular ligand binds to the binding protein forming a “closed” state 
of the sensor. In the presence of the free analyte, the intramolecular 
ligand is displaced from the binding protein leading to an “open” 
state of the sensor. Since a RET donor is inserted into the recombi- 
nant fusion protein, and a RET acceptor is incorporated into the 
intramolecular tether, the conformation change can be translated 
into an optical readout. The equilibrium between the closed and the 
open states of the sensor can then be measured quantitatively. 


Intramolecular tether OO 
RET RET / 
acceptor — acceptor K 
oO 
Interest f 
@ 
——L- 
<_— 


RET 
donor 


Recombinant fusion protein 


RET donors: = 
Covalently attached Fluorescent Luciferase 
synthetic dye protein 


Fig. 1 The Architecture of SNIFITs and LUCIDs. An intramolecular tether (dashed line) containing a RET accep- 
tor and ligand for the binding protein is covalently attached to SNAP-tag, a self-labeling protein (SLP). SNAP- 
tag is fused to a RET donor and a binding protein. Displacing the intramolecular ligand by an analyte of interest 
leads to an overall change in the sensor geometry, and the RET efficiency between a RET donor and RET 
acceptor. The RET donor can be a synthetic dye covalently attached to a second orthogonal SLP, a fluorescent 
protein, or a luciferase 
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The intramolecular tether is covalently attached to the recom- 
binant fusion protein via an SLP. SLPs, such as SNAP-tag [9, 10], 
CLIP-tag [11], and Halo-tag [12], react selectively with various 
bio-orthogonal chemical groups for applications in imaging, as 
well as for purifying proteins and modifying protein function [13- 
15]. SNIFITs and LUCIDs use SNAP-tag, a derivative of the O*- 
alkylguanine-DNA alkyltransferase, which reacts specifically with 
O%-benzylguanine (BG) and chloropyrimidine (CP) derivatives 
(Fig. 2a). These moieties are incorporated at one end of the intra- 
molecular ligand. A second orthogonal SLP, such as CLIP-tag, can 
be used to attach a RET donor fluorophore to the recombinant 
fusion protein. CLIP-tag is a further development of the O°- 
alkylguanine-DNA alkyltransferase that reacts specifically with O?- 
benzylcytosine (BC) (Fig. 2b). 

The optical readouts of SNIFITs and LUCIDS are FRET and 
BRET, respectively. FRET is generally preferred for cell imaging. 
The FRET donor can be either a fluorescent protein or a small- 
molecule synthetic dye, where synthetic dyes have better quantum 
yields, less photo-bleaching, and narrower excitation/emission 
spectra, than fluorescent proteins [16, 17]. For applications on cell 
surfaces, cell-impermeable synthetic dyes should be used, as 
cell-permeable dyes will also label the sensor in the secretory path- 
way. BRET is preferred when bulk measurements are made in 
complex biological mixtures such as serum. Luciferases generate 
light upon reacting with a chemical substrate [18, 19] bypassing 
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Fig. 2 Mechanisms of SLPs used in the work of SNIFITs and LUCIDs. SNAP-tag reacts selectively with 


O®-benzylguanine moieties 


while CLIP-tag reacts with 0*-benzylcytosine (BC). The SLPs are used to 


covalently attach the intramolecular tether and sometimes a fluorescent molecule (here annotated as R), 


to the fusion protein 
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the need to excite the donor fluorophore. This eliminates prob- 
lems with the autofluorescence of other molecules in the samples, 
and eliminates the photo-toxicity and bleaching effects that may 
take place when using FRET. Nevertheless, the photon count 
emitted from the luciferase reaction is lower than that observed 
when a fluorophore is excited, which makes FRET the readout 
method of choice for applications in cell biology. 

To quantify the response of the sensor to an analyte of interest, 
the light intensity is measured at the emission wavelengths of both 
RET partners (after excitation of the RET donor for SNIFITs or 
after the addition of the luciferase substrate for LUCIDs). The 
ratio of these two light intensities (Laonor/Jncceptor) 18 plotted against 
the free analyte concentration (Fig. 3). At low analyte concentra- 
tions, the sensor is mostly in the closed state and the ratio is low 
(lower plateau). At high analyte concentrations, the equilibrium is 
shifted to the open state of the sensor, and the ratio is high (upper 
plateau). The sensor’s dynamic range refers to the change in signal 
magnitude in the absence of analyte compared to under saturating 
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Fig. 3 A typical titration curve from a RET-based biosensor. The sensor is described by the lowest and highest 
possible ratios of the RET donor and RET acceptor emission ratios (lower and upper plateaus), the dynamic 
range and the Cs 


1.2 The Rational 
Design of SNIFIT 
and LUCID Biosensors 


1.2.1 The Binding 
Protein 


7.2.2 The 
Intramolecular Tether 
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concentrations of analyte, and is calculated as the ratio between the 
upper and the lower plateaus. The concentration of analyte that 
causes half the maximum signal change is defined as the cso. The ¢s9 
for a certain analyte can be tuned by altering the affinity of the 
intramolecular ligand for the binding protein. The analyte response 
range can thus be altered to fall within physiological concentra- 
tions of the analyte. 


Some common design principles can be followed to construct a 
SNIEFIT or a LUCID for a specific analyte. Over the years, we have 
gathered experience from our design of biosensors from analytes 
ranging from sulfonamides and neurotransmitters to cancer thera- 
peutics such as methotrexate. Here, we give an overview of how to 
design these biosensors, followed by an overview of their practical 
applications. 


A binding protein is required to display sufficient affinity and speci- 
ficity for the analyte of interest. Since SNIFITs and LUCIDs are 
based on competition with another ligand, the dissociation con- 
stant of the binding protein for the analyte of interest must be 
lower compared to the desired response range. 

Another important factor when choosing the binding protein 
is the availability of structural information, ideally in complex with 
a potential intramolecular ligand. This is not only crucial for the 
geometrical optimization of the sensor, but also for the choice of 
derivatization points of the ligand. In this regard, it can be advan- 
tageous if the binding protein is small, monomeric, and stable. Yet, 
if no suitable natural binding protein is available for the analyte of 
interest, computational methods can be used to design a protein 
with tailor-engineered binding properties [20]. 


The synthetic component of SNIFITs and LUCIDs consists of 
three parts: (1) a BG group that serves as the attachment site for 
SNAP-tag, (2) a fluorophore as RET acceptor, and (3) an intramo- 
lecular ligand (Fig. 4). These three parts are connected by oli- 
goethylene glycol (EG) linkers that must be sufficiently long not to 
affect the binding of the intramolecular ligand and the closing of 
the sensor. We routinely use an (EG); linker between the BG 
group and the fluorophore, and an (EG), linker between the fluo- 
rophore and the intramolecular ligand. 

Three main points must be considered when designing the 
intramolecular ligand of the sensor: (1) its affinity must be strong 
enough for the sensor to be predominantly closed in the absence 
of analyte, (2) its affinity determines the sensor’s analyte response 
range, and (3) its dissociation kinetics determine the response 
kinetics of the sensor. 
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Fig. 4 Example of an intramolecular tether. This intramolecular tether is composed of the reactive group for the 
SLP (0°- benzylguanine), a fluorophore (Cy3) and a ligand for the binding protein (methotrexate) 
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Fig. 5 Sensor’s equilibria. The affinity of the intramolecular ligand determines the equilibrium between the 
closed and open states of the sensor. In the presence of free analyte [A], a second equilibrium is established 
in which there is competitive binding of the free analyte to the binding protein 
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The response of the sensor can be described in two steps: 
unbinding of the intramolecular ligand (q, Eq. (1)) followed by 
binding of the analyte (XG, Eq. (2)) (Fig. 5). Equation (1) shows 
that the ratio between the closed (S(cjosea)) ANd OPEN (Siopeny) State of 
the sensor in the absence of analyte is directly proportional to the 
affinity of the intramolecular ligand. Equation (3) describes the 
relationship between the sensor’s apparent affinity for the analyte 
(Kor) and the analyte’s affinity for the receptor protein (Kd,anatyre) 
(see [21] for a more detailed discussion). 

The effective molarity (M.j) of the intramolecular ligand in 
SNIFIT and LUCID sensors is in the order of 100 pM [22]. The 


1.2.3 Sensor Geometry 
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intramolecular ligand should therefore have a Kd of 10 pM or 
lower, so that the closed state of the sensor is favored over the open 
state by at least a factor of 10 when the analyte is not present. Since 
the analyte competes with the intramolecular ligand for binding to 
the same site, the apparent affinity of the sensor for the analyte is 
inversely proportional to its affinity for the intramolecular ligand. 
In this way, the response range of the sensor can be tuned to the 
desired concentration. However, if the intramolecular ligand’s Kd 
value is very low, its unbinding kinetics tend to be slow, which 
limits the sensor’s temporal resolution. 


To achieve a large dynamic range, the RET efficiency should be 
high in the closed state and low in the open state of the sensor. 
Since the RET efficiency depends strongly on distance, this can be 
achieved by modifying the sensor geometry. The modular architec- 
ture of SNIFITs and LUCIDs allows to control the distance 
between the RET partners in the closed (see D1 in Fig. 6) and open 
states (see D2 in Fig. 6) independently of each other. 

It may be difficult to obtain close proximity between the RET 
donor and RET acceptor in the closed state of the sensor (i.e., aim- 
ing to minimize D1 in Fig. 6) which strongly depends on the 
structural characteristics of a binding protein that is available for a 
given analyte. If the terminus of the binding protein through 
which it is fused to the RET donor is close to the analyte binding 
site, the distance D1 will be small and the construct does not have 
to be optimized further by protein engineering. Yet, if neither of 
the two termini are suitable, it is sometimes possible to circular 
permute the binding protein to create new termini closer to the 
binding site of the analyte (Fig. 7). Circular permutation of the 
binding protein may, however, be cumbersome, and may affect the 
properties of the binding protein. 


Fig. 6 Sensor geometry optimization. D1 refers to the distance in space between the RET donor and acceptor 
in the closed state, and can be optimized by changing the linker length L1. L1 refers to the distance along the 
polypeptide chain covered by parts of the synthetic tether, the ligand-binding domain and the connecting linker 
(highlighted in b/ue). Similarly, D2 refers to the distance in space in between the RET donor and acceptor in the 
open state and can be optimized by increasing L2. L2 refers to the distance along the polypeptide chain cov- 
ered by parts of the synthetic tether, the SNAP tag, and the connecting linker (highlighted in orange) 
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2 Materials 


2.1 Bacterial 
Expression 

and Purification 

of SNIFITs and LUCIDs 


Fig. 7 An example of the binding protein dihydrofolate reductase from E. col with 
bound ligand. The crystal structure of eDHFR (PDB ID: 1RA3) is represented in 
gray cartoon, the methotrexate analyte is represented in gray sticks. The site of 
derivatization for the intramolecular ligand is highlighted by an arrow. The sites 
for circular permutation are shown as dashed lines 


The distance D2 between the RET donor and RET acceptor in 
the open state of the sensor can be controlled by modifying the 
linker L2 which connects the SNAP-tag and the RET donor. Since 
simple GGS linkers are conformationally flexible, they are not opti- 
mal as spacers [23]. Instead, polyproline linkers can be used, as 
they form rigid helical structures [24]. In our experience, a 
30-proline linker is sufficient to reduce RET efficiency in the open 
state [25]. 

A list of all SNIFIT and LUCID sensors developed to date is 
given (Table 1) while the following protocol provides a step-by- 
step summary how to construct and apply individual sensors. 


1. RosettaGami DE3 pLysS electrocompetent E. coli cell. 


2. LB medium: 5 g/L NaCl, 10 g/L tryptone, 5 g/L yeast 
extract pH 7.5. 


3.100 mg/mL ampicillin stock solution (1000x), 
filter-sterilized. 


4. LB-agar plates supplemented with 100 pg/mL ampicillin. 


5.1 M_ isopropyl-B-thiogalactoside stock (IPTG) (1000x), 
filter-sterilized. 


6. Sonicator or French press. 


109 


SNIFITS and LUCIDs: Semi-Synthetic Modular Biosensors 


oNINRIALL 
WU 7 %S8F Lad duOIDIsaSOI -IVNS-OfddIN-€ OIDIA aiont [og] UIXOSIG 
ONINRAILL 
wi ez %16S Lada oprueuoynsereg WOH IN-0€dd-dVNS aqiont [og] = (Wa) aaewrestdoy, 
SD6FTU 
onIN-¢A9 uly doyaA9 ueumyds (WU) 
WU" 00S %T6L Laud von paxt|y -ONIN-0€dd-dYNS aint [og] ¥ autrodsopsAcQy 
onIN-€49 Td (Wu) snunjoms 
WU ZT %09F LAU = pues] poproy-omy, ON IN-0€dd-dVNS qiont [oe] pur snumporoey, 
oNIN-¢49 oyexo.QoyIoul SOFT TAH ads (nu-yjyu) 
Wl sg-sz'0 %OFET Lada /vadoy ow], -ONIN-0€dd-dVNS qiont [og] aTexoNOYIA 
eA9-Gh9 aouyans (Ww-WU) 
akp-akq ¢[D-wumrmuoydoips 10 dVNS ][9> pur ovgea ur sIOUqIyUl 
Ww OI-WY Z %S9 Lada umruompauessqd -§YODd5D)-dI1O-AYOV LISINS [67]  ayOV pure yoV 
88PEXTV-SAD 
adp-Ih] €8ZIs (ww) 
Suistueysout susoddg IDO Istuosejyue possoidxs09 7gqy aoryans [[95 (WAVD) poe 
W1 00T (S192) %08 Lada joidaoa1 "WV puke eT G¢D-dITO-dVNS LISINS = [SZ] etArnqouruny-A 
L¥SKA-SAD aoepans 
(sqI29) %9S ak p-ahT TSTS-SUNIDE —JP9_ pure oz wr (nd 
Ww" ZI (42) %06 Law a7eUTeIN]D) -dI10-S$ldd-dVNS LISINS [ZZ] —-ysry) azeureinty 
L¥SKA-SAD 
akp-ahC. 
Array Qur-gAQ WOH -dITO-dd-dVNS aoepans (WU) 
(s[[2>) %0E7  u2a20.0d guaasasonyf- ap VOH-dITO-dVNS = []99 puronmur = [gz +UZ, pur (Wt) 
W1 009-077 ~— (42) %O6E Lada saprueuoyng = WOH -AroyOul-qYNS LISINS ‘C¢] = ssouqryut WOH 
on) afues wsiueysa|y puebly ubisaq asn ayAjeuy 
oiwmeukg Jejnoajowejul 


a}eP 0} paysiiqnd siosuas QI9NT pue LI4INS Jo Asewwing 


} a1qeL 


110 Helen Farrants et al. 


2.2 In Vitro Labeling 
and Characterization 


2.3 Cell-Surface 
Expression 


. Ni?*-NTA agarose slurry. 


. Propylene column. 


9. Ni**-NTA lysis buffer: 50 mM KH,PO,, pH 8, 150 mM NaCl, 


aon FW HN 


5 mM imidazole. 


. Phenylmethylsulfony! fluoride (PMSF): 100 mM in ethanol. 
. Lysozyme: 10 mg/mL in water. 
. Ni?*-NTA wash buffer: 50 mM KH,PO,, pH 7.5, 150 mM 


NaCl, 10 mM imidazole. 


. Ni?*-NTA elution buffer: 50 mM KH,PO,, pH 7.5, 150 mM 


NaCl, 500 mM imidazole. 


. Bradford reagent. 


. Streptavidin-resin in propylene column, such as the Strep- 


Tactin column. 


. Streptavidin wash buffer: 100 mM Tris-HCl, pH 8, 150 mM 


NaCl, | mM EDTA. 


. Streptavidin elution buffer 100 mM Tris-HCl, pH 8, 150 mM 


NaCl, 1 mM EDTA, 2.5 mM desthiobiotin. 


. Centrifugal filter device with appropriate molecular weight 


cutoff. 


. Storage buffer: 50 mM HEPES, 50 mM NaCl, pH 7.2. 


. Labeling buffer: 50 mM HEPES, 50 mM NaCl, pH 7.2, 


1 mg/mL bovine serum albumin (BSA). 


. Fluorescein or tetramethylrhodamine (TMR) labeled ligand 


stock solution in DMSO, for characterizing the binding 
strength by means of fluorescence polarization. 


. BG-fluorophore-ligand conjugate stock solution in DMSO, for 


labeling SNAP-tag based sensors with an intramolecular tether. 


. BC-fluorophore-ligand conjugate stock solution in DMSO, for 


labeling CLIP-tag-based sensors with an intramolecular tether. 


. Nonbinding multiwell plates (black or white). 


. Plate reader with filters for fluorescent polarization and fluo- 


rescent measurements. 


. Nano-glo® substrate from Promega. 


. Glass coverslips (@ 15 mm). 

. Ethanol. 

. Poly-L-lysine solution: 0.7 mg/mL in sterile water. 
. HEK293 cells. 

. Lipofectamine 2000. 

. Opti-MEM® I reduced serum medium. 


3 Methods 


3.1 Generation 
of Fusion Proteins 
for Expression in E. 
coli 
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7. Dulbecco’s Modified Eagle Medium (DMEM) Glutamax 
medium supplemented with 10% fetal bovine serum. 


8. Hank’s Balanced Salt Solution (HBSS) supplemented with 
10 mg/mL of BSA. 


Standard methods of cloning can be used to construct the expression 
plasmids for the recombinant fusion proteins. We initially utilized 
restriction-enzyme cloning and ligation methods, but have recently 
started to use newer ligation methods such as Gibson assembly [26]. 

We advise using a pET51b(+) (Novagen) expression vector 
that contains a double-purification system of a His-tag and a Strep- 
tag. Other purification systems can be used, but it is important to 
use affinity purification tags on both the N- and C-termini of the 
protein to prevent purifying truncated versions of the sensor. We 
provide here a general protocol for the expression of sensors, but 
modification of the method may be necessary if the binding pro- 
tein is unstable or insoluble. 


1. Clone the sensor into a bacterial expression vector using stan- 
dard cloning methods (see Note 1). 


2. Transform RosettaGami DE3 pLysS E£. coli cells for protein 
production, plate cells onto selective LB agar medium, and let 
colonies grow at 37 °C overnight. 


3. Pick a single colony and use it to inoculate 2 mL of selective 
LB medium and allow the bacteria to grow overnight, with 
shaking at 220 rpm at 37 °C. 


4. Inoculate 1 L selective LB medium with 1 mL of the precul- 
ture and incubate (220 rpm at 37 °C) until an OD 60am Of 0.6- 
0-8 is reached. 


5. Cool the incubator to the expression temperature and induce 
the expression of the protein depending on the promoter used 
(see Note 2). 

6. After expression, harvest the cells by centrifugation and resus- 
pend the bacteria in a total of 30 mL of Ni’?*-NTA lysis buffer 
supplemented with 1 mM PMSF and 0.25 mg/mL of 
lysozyme. 

7. Lyse cells by sonication on ice. 

8. Centrifuge at 40,000 x g for 10 min at 4 °C to remove cell 
debris and insoluble proteins. 

9. Incubate the supernatant with 1 mL Ni?*-NTA agarose slurry 
under agitation for 1 h at 4 °C. Then centrifuge gently and 
transfer the resin onto a propylene column. 
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3.2 Chemical 
Synthesis 

of the Intramolecular 
Tether 


3.3 Determining 
the Affinity 

of the Intramolecular 
Ligand 


10. Wash the Ni**-NTA agarose with at least six column volumes 
of cold Ni**-NTA wash buffer. 


11. Elute the protein with 500 wL aliquots of Ni?*-NTA elution 
buffer. Estimate the protein concentration of each fraction 
using Bradford Reagent and/or absorbance at 280 nm. 


12. Take aliquots of the eluate of the Ni**-NTA column to follow and 
analyze the purification process by SDS-PAGE (see Note 3). 


13. Load all of the eluted fractions that contain protein onto a 
StrepTactin-column. 


14. Wash the column using the Strep-tag washing buffer. Wash 


with six column volumes. 


15. Elute the protein using 500 pL fractions of Strep-tag elution 
buffer (for a 1 mL column). Estimate the protein concentra- 
tion of each fraction using a Bradford quantification and/or 
absorbance at 280 nm. 


16. Take aliquots of each step and run an SDS-PAGE to verify the 
purity of the protein. 


17. Concentrate the protein using molecular-weight dependent 
filtration devices, with the recommendation of the provider, 
and change the buffer to storage buffer. Dilute the sample in a 
1:1 ratio with glycerol 87% and store it at -20 °C at a concen- 
tration between 20 and 200 uM. 


The synthesis of the intramolecular tether heavily depends on the 
chemical properties of the intramolecular ligand. In one approach, 
each component of the intramolecular tether (i.e., the BG-reactive 
group, fluorophore, and intramolecular ligand) can be synthesized 
separately in a modular approach and the tether subsequently 
assembled using standard chemical methods such as amide bond 
formation, click chemistry, and alkylation. For examples, see [22, 
25, 27-30]. When making the intramolecular tether, it is worth 
making also a ligand that only carries a fluorophore and can subse- 
quently be used to measure the K, by means of fluorescence polar- 
ization measurements. 


Many methods can be used to determine the affinity of the recep- 
tor protein for both the intramolecular ligand and the free analyte. 
We routinely use fluorescence polarization to determine both of 
these parameters. 


1. Prepare serial dilutions of the receptor protein in labeling buf- 
fer that contains 20 nM fluorescein or tetramethylrhodamine 
(TMR)-labeled ligand. 


2. Pipette 100 pL of each dilution into the wells of a black 96-well 
plate. 


3. Incubate the plate for 10 min at room temperature. 


3.4 In Vitro Labeling 


1. 
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. Record the fluorescence polarization signal using a suitable 


plate reader. 


. Plot the fluorescence polarization response against the protein 


concentration, and fit the curve to a single-site binding iso- 
therm to determine the Kd of the receptor protein for the 
derivatized ligand: 


Fe. — Fy 
F — F ; n max min 
min K 


d, fluorescent _ ligand 


[ protein | 


. Dilute the protein to a concentration close to the Kd for the 


labeled ligand in the same labeling buffer with 20 nM fluores- 
cein- or TMR-labeled ligand; then add serial dilutions of the 
free ligand analyte. 


. Record the fluorescence polarization signal using a suitable 


plate reader. 


. Determine the concentration of unbound protein at each ana- 


lyte concentration from the equation of the single-site binding 
isotherm (using the values for TEM fisesreatects itis Fins and Fax 
determined above): 

(E 


min 


= F) 
d, fluorescent _ ligand F a EF 


[ protein,... ] =K 


max 


. Plot the concentration of free protein against the analyte con- 


centration and fit the curve to a single-site binding isotherm to 
determine the Kd of the receptor protein for the free analyte. 


Label the recombinant fusion protein with synthetic mole- 
cules. Common concentrations are 1 4M sensor protein with 
2 uM BG-based substrate for SNAP-tag, and 10 pM BC-based 
substrate for CLIP-tag in labeling buffer (see Note 4). 


. For SNIFITs, remove the excess labeling reagents using three 


washes with labeling buffer in centrifugal filtering devices, or using 
a small gel filtration column. This is not needed for LUCIDs. 


. Quantify the amount of labeled protein by estimating the con- 


centration of dye from its specific absorbance and comparing it 
to the protein absorbance at 280 using a spectrophotometer, 
such as the nanodrop. 


. The quantitative labeling of the sensor, assuming fully func- 


tional SLPs, can also be verified by SDS-PAGE and in-gel fluo- 
rescence scanning. To this end, incubate the labeled sensor 
with an additional fivefold excess of a BG- or BC-based deriva- 
tive labeled with a second fluorophore for 30 min at room 
temperature, and then separate the samples by SDS-PAGE. 
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3.5 In Vitro Titrations 


3.6 Application 
of SNIFITs and LUCIDs 
on the Cell Surface 


1. 


Prepare a suitable dilution of the sensor in labeling buffer and 
add 98 wL into each well of a 96-well plate. Use nonbinding 
black 96-well plates for fluorescence and white plates for lumi- 
nescence measurements. We find that 50 nM sensor is suitable 
for SNIFITs and 5 nM for LUCIDs. 


. Prepare serial dilutions of the analyte of interest in DMSO, and 


add 2 uL of the dilutions to each well with the sensor. 


. Incubate for 1 h at room temperature, or until the sensor is 


fully open (see Note 5). 


. Record spectra on a spectrophotometer exciting the FRET 


donor, or add luciferase substrate to give an adequate concen- 
tration. We usually use a 400-fold dilution in storage buffer of 
Nano-glo® substrate from Promega stock with LUCIDs. 


. Plot the intensity ratio (R) between the RET donor and RET 


acceptor against the sensor concentration, and fit it to a com- 
petitive single binding site isotherm. 


R = 7 donor 
acceptor 


R= 
R = R + sat Begs 


£50 


]+——__ 
[ Analyte] 


. Clone the sensor of interest into a pDisplay mammalian expres- 


sion vector using standard cloning protocols. We usually use 
transient transfection for sensor evaluation on cell surfaces, but 
semistable cell lines can also be prepared. 


. Sterilize glass coverslips (@ 15 mm) by dipping them into an 


ethanol bath and briefly holding them in a Bunsen burner 
flame. 


. Place the sterile coverslips into 12-well cell culture plates. 


. Coat the coverslips by adding 1 mL poly-L-lysine solution 


(0.7 mg/mL in sterile water) per well. Incubate the plate over- 
night at 37 °C. 


. Remove the poly-l-lysine and wash the cover slips once with 


1 mL sterile water. 


. Seed HEK293 cells to around 30% confluency in DMEM 


Glutamax medium supplemented with 10% fetal bovine serum, 
and grow at 37 °C and 5% CO, in a humidified incubator. 


. The next day, transfect the cells with the plasmid using a trans- 


fection agent of choice. We routinely use Lipofectamine 2000. 
Dilute the target DNA to a final concentration of 0.8 pg/mL 
in Opti- MEM® [reduced serum medium and add Lipofectamine 
2000 to a final concentration of 1 pL/mL. Allow to stand at 
room temperature for 20 min to form a pre-complex. Dilute a 


4 Notes 


10. 


11. 


12. 
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further tenfold and add 100 pL of the Lipofectamine-DNA 
mixture to each well and allow to stand for 3-6 h at 37 °C and 
5% CO3. 


. Optional step: Exchange the medium after 4-6 h to remove 


the transfection reagent. 


. 24-48 h after transfection, label the sensor by replacing the 


medium with HBSS supplemented with 10 mg/mL BSA and 
2 uM BG-based and 10 pM BC-based fluorophore-ligand 
label. In order to avoid using large amounts of the synthetic 
compounds, the glass coverslips can be transferred from the 
12-well plate onto a sheet of parafilm and 100 pL of the label- 
ing solution carefully pipetted onto the edge of the coverslip. 


After 10 min of labeling at room temperature, aspirate the 
labeling solution and wash the glass coverslip four times with 
HBSS. 


For perfusion experiments, transfer the coverslip into a flow 
cell. We use a Warner imaging chamber RC-20 and gravity-fed 
perfusion at a flow rate of 0.5 mL/min. Ifno perfusion system 
is available, the analyte can be added to the cells by pipetting 
(see Note 6). 


Image the sensor using a standard wide-field or confocal fluo- 
rescence microscope. 


. The polyproline region can be difficult to clone, as it is highly 


repetitive. We have cloned the region by annealing 5’-phos- 
phrylated oligonucleotides and ligating them into a restriction- 
digested plasmid. 


. We use pET vector expression systems that are induced by 


IPTG (0.5-1 mM final concentration) and expressed at 16 °C 
overnight. 


. You can follow the sensor in the different fractions by labeling 


it with a fluorescent substrate and doing an in-gel fluorescent 
scan. 


. We recommend keeping the DMSO concentration below 5%, 


as higher concentrations may denature the protein. The dura- 
tion of labeling depends on the intramolecular ligand, but we 
find that the reaction is finished after 1 h. 


. The kinetics of opening may differ between sensors, depend- 


ing on the affinity of the intermolecular tether for the binding 
protein. 


. We find that measuring the half-life Of fopening AN foosing ON Cell 


surfaces is often limited by the perfusion speed of the perfusion 
chamber. 
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Chapter 8 


Ultrasensitive Firefly Luminescent Intermediate-Based 
Protein-Protein Interaction Assay (FlimPIA) Based 

on the Functional Complementation of Mutant Firefly 
Luciferases 


Yuki Ohmuro-Matsuyama and Hiroshi Ueda 


Abstract 


We recently developed a protein-protein interaction assay, FlimPIA (Firefly luminescent intermediate-based 
Protein-protein Interaction Assay) based on the catalytic mechanism of firefly luciferase (Fluc) that can be 
divided into two half-reactions: the adenylation step and the oxidative luminescent steps. We engineered 
two mutant Fluc enzymes named “Donor” and “Acceptor” where the oxidative luminescent activity of the 
Donor is almost eliminated and the adenylation activity of the Acceptor is suppressed. When the Donor 
and the Acceptor are each fused to one of two interacting partners, and put together to interact, the Donor 
and the Acceptor come sufficiently close such that the Acceptor can react with the luciferyl-adenylate 
intermediate (LH2-AMP) produced by the Donor, and thus emit luminescence. 

FlimPIA can be used in vitro and in cultured cells. Owing to recent improvements, it has several 
advantages in terms of signal/background ratio, detectable size of interacting partner, and sensitivity over 
conventional protein-protein interaction assays based on Foérster/fluorescence resonance energy transfer 
and protein-fragment complementation performed in vitro. Here, we describe a protocol to make use of 
the latest version of FlimPIA which shows even lower background and higher signal than previously 
described ones. 


Key words Protein-protein interaction, Firefly luciferase, Substrate channeling, Reaction intermedi- 
ate, Sensitivity, Detectable distance, Signal-background ratio 


1. Introduction 


Development of protein-protein interaction (PPI) assays is consid- 
ered highly important in various fields, such as molecular and 
cellular biology, diagnostics, and drug screening. Recently, we 
developed a novel protein-protein interaction assay that we termed 
FlimPIA (Firefly luminescent intermediate-based Protein-protein 
Interaction Assay) based on the following unique detection 
principle [1]. 


Viktor Stein (ed.), Synthetic Protein Switches: Methods and Protocols, Methods in Molecular Biology, vol. 1596, 
DOI 10.1007/978-1-4939-6940-1_8, © Springer Science+Business Media LLC 2017 
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The method is based on the multiple (two half) reactions 
catalyzed by firefly luciferase (Fluc). Fluc is frequently applied as a 
reporter enzyme in biological assays because of its sensitivity, 
dynamic range, rapidity, and easy-to-handle measurement. In Fluc- 
catalyzed reactions, firefly p-luciferin (LH;) is converted to lumi- 
nescent oxyluciferin (OxL). This reaction can be subdivided into 
two half-reactions, namely the adenylation step and the oxidative 
luminescent steps (Fig. 1a). In the former step, a luciferyl-adenylate 
intermediate (LH,-AMP) is produced from LH, and ATP, and in 
the latter OxL is produced via proton abstraction at the C4 carbon 
of LH,-AMP. We made two mutant Flucs named the “Donor” and 
the “Acceptor.” The Donor is a triple mutant H245D/K443A/ 
L530R based on the most commonly used Photinus pyralis residue 
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Fig. 1 Principle of FlimPIA (a) Chemical reactions catalyzed by Fluc. Fluc produces 
excited state oxyluciferin (OxL) from p-luciferin (LH.) by a two-step catalysis, the 
adenylation step, and the following oxidative steps. (b) Working mechanism of 
FlimPIA. The adenylation acitivity of the Donor is suppressed, on the other hand, 
the oxidative activity of the Acceptor is suppressed. Donor and Acceptor are 
enough close that the Acceptor use luciferyl-AMP (LH,-AMP) produced by Donor 
when binding domains are interacting. Adapted with permission from Fig. 1c in 
Ref. 2. Copyright John Wiley and Sons 
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number. For this mutant, the oxidative luminescent reaction is syn- 
ergistically suppressed by the H245D/K443A mutations while the 
adenylation reaction is enhanced by the L530R mutation. As a 
result, the Donor can efficiently produce LH,-AMP, but can only 
poorly produce OxL. On the other hand, the Acceptor that origi- 
nally featured a K529Q mutation had a lower adenylation activity 
while maintaining full oxidative luminescent activity. Thus, the 
Acceptor cannot produce LH,-AMP? efficiently on its own. 

To perform the interaction assay, the Donor and the Acceptor 
are each fused to one of two interacting proteins of interest, respec- 
tively. When they do not interact with each other, the Donor and the 
Acceptor are far apart, and the luminescence intensity remains low. 
However, when they are brought into close proximity by the inter- 
acting protein pair, the Acceptor can react with LH,-AMP that was 
produced by the Donor. As a result, the luminescence that is emitted 
is increased (Fig. 1b). The splitting of two half reactions is made 
possible partly due to the conformational change of the Fluc enzyme, 
which accompanies rotation of the C-terminal domain. 

So far, several attempts to improve the PPI assay FlimPIA have 
been achieved, and several advantages over conventional PPI assays 
have been discovered [1-3]. First, FlimPIA generally works well 
in vitro as well as in cultured cells. Second, compared to conven- 
tional Fluc-based PPI assays based on protein-fragment comple- 
mentation assays (PCA), the FlimPIA probes are more thermostable 
and reproducibly give higher luminescence. Third, the distance 
that can be detected between the probes is longer than the distance 
between the probes for Forster/fluorescence resonance energy 
transfer (FRET)-based PPI assays and PCA. In practice, this means 
that larger interacting proteins can be detected by FlimPIA. Fourth, 
the sensitivity is approximately tenfold higher than Fluc-based 
PCA both in terms of the detection limit and signal intensity 
in vitro. Finally, using a recently discovered double mutant R437K/ 
K529Q as a new Acceptor significantly improved signal /background 
(S/B) ratio as well as the signal intensity under optimized assay 
conditions. The mutation of residue R437 which is located in the 
hinge region connecting the N-terminal and C-terminal domains 
was found to selectively lower adenylation activity. The obtained 
maximum S/B ratio (> 40) is higher than that obtained with Fluc 
PCA under the same experimental reaction conditions. In this 
chapter, a protocol to generate and assay this latest version of 
FlimPIA is described [2]. 


Prepare all solutions using ultrapure water (made by purifying 
deionized water to attain resistivity of 18 MQ cm at 25 °C). For all 
the reagents, use them of the highest grade available. 
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1. 


100 mM ATP stock solution made by adjusting pH with 
sodium hydroxide (NaOH) to 7.2, and stored at —80 °C. 


.20 mM Firefly p-luciferin (LH2) (available from Biosynth, 


Staad, Switzerland) stock solution made by adjusting pH by 
NaOH to 7.3, and stored at —20 °C. 


. QuikChange Site-Directed Mutagenesis kit and/or QuikChange 


Multi Site-Directed Mutagenesis kit (Agilent Technologies, 
Santa Clara, CA). 


4. In-Fusion HD cloning kit (Takara-Bio, Otsu, Shiga, Japan). 


. E. coli JM109 competent cells. 


6. LB agar: 5 g/L yeast extract, 10 g/L tryptone, 10 g/L NaCl 


20. 


21. 


and 15 g/L agar. 


. LB liquid medium: 5 g/L yeast extract, 10 g/L tryptone, and 
10 g/L NaCl. 
. LBA agar plate: LB agar supplemented with 100 pg/mL 


ampicillin. 


. LBA liquid medium: LB liquid medium supplemented with 


100 pg/mL ampicillin. 


. E. coli SHuffle T7 express lysY competent cells (New England 


Biolabs Japan, Tokyo, Japan). 


. 10 mM Rapamycin (e.g., LKT Laboratories, St. Paul, MN) 


stock solution dissolved in methanol or dimethyl sulfoxide, 
stored at —20 °C. 


. 1 M Isopropyl-£-thiogalactopyranoside (IPTG) stock solution 


for inducing protein expression. 


. White half area 96-well plate (Costar 3694, Corning-Costar, 


Osaka, Japan). 


. Aplasmid encoding Photinus pyralis Fluc cDNA such as pGEX- 


Ppy or pGEM-luc (Promega, Madison, WI). 


. pET32b (Novagen, Merck, Darmstadt, Germany). 
. TALON metal affinity resin and TALON disposable gravity 


column (Clontech, Takara-Bio). 


. Toyopearl HW-40S gel filtration resin (Tosoh, Tokyo, Japan). 


. Synthetic genes coding for binding domains such as E. coli 


codon-optimized human FK506-binding protein 12 (FKBP12) 
and FKBP-rapamycin-associated protein (FRB), appended with 
Neol/Sfil and NotI sites at the 5’ and 3’ ends, respectively. 


. TALON extraction buffer: 300 mM sodium chloride (NaCl), 


50 mM sodium phosphate, adjusted to pH 7.0. 
TALON elution buffer: 150 mM of imidazole in TALON 
extraction buffer. 


MOPS buffer: 100 mM MOPS, 10 mM MgSOu, pH 7.3 
(see Notes 1 and 2). 


3 Methods 


3.1 Construction 
of the Binding 
Domain-Fused 
Expression Vectors 
(Fig. 2) 


3.2 Construction 
of BD1/Donor 
Expression Vector 
(Fig. 2a) 


22. 


23. 


24. 


25. 


26. 
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20 mM coenzyme A trilithium salt stock solution dissolved in 
water, and stored at —20 °C. 


For the determination of enzyme kinetics, use a multiwell 
luminometer such as AB-2350 equipped with a sample injector 
(ATTO, Tokyo, Japan). 

For the calculation of enzyme kinetics, use a software that can 
perform nonlinear curve fitting, such as Kaleida Graph 4.1 
(Synergy Software, Reading, PA). 

For FlimPIA with faster kinetics, use a tube-based luminome- 
ter such as NU-2600 equipped with a sample injector and a 
stirrer (Microtech Nichion, Chiba, Japan). 


Optional: For the measurement of LH,-AMP concentration, 
use a fluorescence spectrometer such as FP-8500 (Jasco, Tokyo, 
Japan). 


. Amplify the DNA fragment encoding FLuc derived of Photinus 


pyralis (Ppy) by PCR using pGEX-Ppy [4] or pGEM-luc vector 
as a template, and primers NotG4SBack (with G,S linker and a 
NotI site) and XhoFor (with an XhoI site). 


. Digest the amplified fragment with restriction enzymes NotI 


and Xhol, and subclone into the multicloning site of pET32b 
between Nol and Xhol sites. 


. Prepare a pair of DNA fragments each encoding a binding 


domain (BD) of your interest, which are appended with restric- 
tion sites Neol and Nozl by PCR. Digest the product with the 
same enzymes, and ligate each with the plasmid prepared in 2 
that was digested with Neol and NozI (see Notes 3 and 4). 
Here, the procedure is exemplified with FKBP12 and FRB that 
are denoted as BD1 and BD2, respectively (see Note 5). 


. Transform JM109 competent E. coli cells with the ligation 


mixture. 


. Culture the cells on LBA agar plates and incubate overnight at 


37 °C, 


. Pick a colony and culture it in 4 mL LBA medium and incu- 


bate overnight at 37 °C. 


. Extract the plasmid, and confirm the nucleotide sequence of 


the entire open reading frame. 


. To make H245D/K443A/L530R/E354K mutants as the 


Donor, conduct a series of site-directed mutagenesis with 
QuikChange Site-directed mutagenesis kit as per manufacturer’s 
instructions. Use pET32/BD1/Fluc as the template and primers 
H245D, K443A, L530R, and E354K (Table 1) in combination 
with their complementary strand primers (see Notes 6 and 7). 
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A 


His, G,S H245D E354K K443A His, 
Trx || FKBP12 | | } Fluc ] | | 

His, Gus E354K R437K K529Q His, 
Tx || FRB || Fiuc | | | | 


Fig. 2 Schematic structure of the constructs of BD/Donor and BD/Acceptor. (A) The structure of BD1/Donor. The 
Donor is expressed as a fusion protein with thioredoxin (Trx), which is originally encoded in pET32, and FKBP12 with 
a short linker GGGS (G,S). (B) The structure of BD2/Acceptor. The Acceptor is expressed as a fusion protein with Trx 
and FRB with a G,S linker. Adapted with permission from Fig. S2 in Ref. 2. Copyright John Wiley and Sons 


Table 1 


Nucleotide sequence of the primers 


Sequence (5’-3’) 


NotG4SBack 


XhoFor 
H245D 
K443A 
L530R 
E354K 
K529Q 
FlucEcoBack 


GGCGCGCCGCGGCCGCCGGTGGTGGTGGTAGCATGGAA GACGCCAAA 
AACATAAAG 


GGCGCGCCTCGAGCTTTCCGCCCTTCTTGGCCT 
GTTCCATTCCATGACGGTTTTGGAATGT 
GACCTCTTGAAGTCTTTAATTGCATACAAAGGATATCAGGTGGC 
CCGAAAGGTCTTACCGGTAAACGCGACGCAAGAAAAATC 
TICTGATTACACCCAAGGGGGATGATAAA 
CCGAAAGGTCTTACGGTCRRCTCGACGCAAGAAAATCAGAGAG 
CAATTGCACTGATAATGAA 


FlucR437KFor GATATCCTTTGTATTTAATTAAAGACTTGAKCTTATCAACTATGAAG 


AAGTGTTCGTC 


2. After thermal cycling, add 1 pL of DpuI (~10 U) to the reaction 
mixture to digest the methylated template by incubating for 
1 hat 37 °C. Transform JM109 competent cells with a part of 
the digested template. 

3. Culture the cells on LBA agar plates. 


4. Pick several colonies and culture each colony in 4 mL LBA 
medium. 


5. Extract the plasmid and confirm the correct DNA sequence. 


3.3 Construction 1. To make a K529Q/E354K mutant, perform QuikChange 
of the BD2/Acceptor Site-directed mutagenesis using pET32/BD2/Fluc as a tem- 
Expression Vector plate and primer pairs K529Q and E354K (as in Subheading 3.2, 
(Fig. 2b) step 1) in combination with their complementary strands 


(see Notes 7 and 8). 


3.4 Expression 

and Purification 

of the BD1/Donor 

and the BD2/Acceptor 
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2. Follow the steps 2-5 in Subheading 3.2. 


3. Amplify the DNA fragment using primers FlucEcoBack and 


2 


FlucPacR437K¥ or using the plasmid constructs generated in 
steps 1 and 2 as the template. 


. Digest the plasmid constructs generated in steps 1 and 2 with 


EcoRI and Pacl. 


. Recombine the fragments of steps 3 and 4 using an In-Fusion 


HD cloning kit as per manufacturer’s instructions. 


. Transform £. coli SHuffle T7 Express lysY competent cells 


with pET32/BD1/Donor and pET32/BD2/Acceptor (see 
Note 9). 


. Incubate the transformed cells on a LBA agar plate at 30 °C 


for 24 h. 


. Pick a colony, and culture it in 4 mL LBA medium at 30 °C. 
. Centrifuge 0.4 mL of the resultant cell culture and collect the 


cell pellet (see Note 10). 


. Resuspend the cell pellet with 100 mL of LBA medium in a 


500 mL flask, and culture at 30 °C. 


. When the OD 600 reaches 0.4—0.6, add IPTG to a final concen- 


tration of 40 pM and culture the cells at 16 °C. 


. Centrifuge and collect the cells, and resuspend the cell pellet 


with 10 mL of the TALON extraction buffer. 


. Sonicate the cells on ice using an ultrasonifier for 2 min (50% 


interval) five times (see Note 11). 


. Centrifuge the sonicated solution at 11000 x g for 20 min at 


4 °C. 


. Add 200 pL of TALON Metal Affinity Resin to the superna- 


tant, and rotate or shake gently for 20 min at 25 °C. 


. Centrifuge at 700 x g for 2 min at 4 °C, and resuspend the 


resin with 10 mL of TALON extraction buffer. 


. Rotate or shake gently for 10 min at 4 °C. 

. Repeat steps 11 and 12 two times. 

. Centrifuge at 700 x g for 2 min and collect the resin. 

. Suspend the resin with 3 mL of TALON extraction buffer, and 


transfer the suspension into a TALON 2 mL Disposable 
Gravity Column. 


. Allow to drain until it reaches the top of the resin bed. 
. Wash the column with 2 mL of TALON extraction buffer. 
. Elute the enzyme with 500 pL of TALON elution buffer (see 


Note 12). 
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3.5 Chemical 
Synthesis 

and Purification 

of LH,-AMP 

(See Note 13) [5, 6] 


3.6 LH, Kinetic 
Determination of BD1/ 
Donor and BD2/ 
Acceptor (See Note 19) 


3.7 ATP Kinetic 
Determination 

of BD1/Donor and 
BD2/Acceptor 
(See Note 19) 


1. 


iS) 


13. 
14. 


2. 


Dissolve 15 mg of adenylic acid (AMP) and 5 mg of LH, in 
1 mL of dimethy! sulfoxide. 


. Dissolve 100 mg of dicyclohexylcarbodiimide in 1 mL of 


dimethyl! sulfoxide. 


. Mix the solutions of | and 2. 

. Incubate for 10 min at room temperature (see Note 14). 
. Add 5 mL of acetone to stop the reaction. 

. Centrifuge and carefully remove the supernatant. 

. Add 3 mL of acetone. 

. Centrifuge and carefully remove the supernatant. 

. Repeat steps 7 and 8. 


. Dissolve the precipitate in 1.5 mL of the solution of acetic acid 


(10 mM) and NaCl (40 mM). 


. Apply to the column containing the resin of Toyopearl HW-40S 


(bed volume ~2 mL). 


. Collect the fraction by adding 1 mL of diluted hydrochloric 


acid (pH 4.5) (see Notes 15 and 16). 
Repeat step 12 10-15 times. 
Measure the fluorescence spectrum of the each fraction in 
steps 12 and 13 with excitation wavelength of 327 nm, and 
collect the fractions that emit fluorescence peaking at around 
535 nm (see Note 17) [7]. When the peak values of LH,-AMP 
and «nM of LH, are y and g, respectively, the concentration of 
LH,-AMP can be determined as follows (see Note 18). 

x xy 


The concentration of LH,-AMP (nM) = ————_ 
zx 0.45. 


. Prepare substrate solutions with varied concentrations 


1-2000 uM of LH, in the MOPS buffer containing 20 mM of 
ATP (see Note 20). 


Dispense 50 pL of the solution to a well of 96-well plate. 


3. Measure the bioluminescence intensity after injection of 50 pL 


of 50 nM of the probe (BD1/Donor or BD2/Acceptor) 
diluted in the MOPS buffer with 0.1-s interval for 10 s. 


. Use the maximal light intensity when each concentration of 


LH), was applied and perform nonlinear curve fitting to derive 
Vinx aNd Ky, using Kaleida Graph 4.1. 


. Prepare solutions with varying concentrations (3-2000 pM) 


of ATP in MOPS buffer containing 10 mM of LH, (see 
Note 20). 


. Follow the steps 2—4 in Subheading 3.6. 
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1.5x10* 


RLU / 0.01 sec 


0 0.5 1 1.5 2 
Time after substrate injection / sec 


Fig. 3 Rapamycin dose dependency detected by FlimPIA. The concentration of 
FKBP/Donor and FRB/Acceptor used was 100 nM each. Adapted with permission 
from Fig. 4a in Ref. 2. Copyright John Wiley and Sons 


3.8 LH,-AMP Kinetic 1. Prepare solutions with varying concentrations (2-10,000 nM) 


Determination of BD1/ of LH;-AMP in MOPS buffer. 

Donor and BD2/ 2. Follow the steps 2—4 in Subheading 3.6. 

Acceptor (See Note 21) 

3.9 FlimPIA 1. Suspend equimolar BD1/Donor and BD2/Acceptor (20- 


500 nM) with varying concentrations of rapamycin in the 
MOPS buffer (see Note 3). 

2. Dispense the mixture (50 pL) to an assay tube for a 
luminometer. 

3. Measure the light intensity after injecting 450 pL of 1.1 mM 
ATP, 83 uM of LH;, and 1.1 mM coenzyme A with 10-msec 
interval for few seconds (Fig. 3, see Notes 22-25). 


4 Notes 


1. MgSOg is essential for the adenylation step (Fig. 1a). 


2. Emission intensity and color of light is strongly influenced 
by pH. 


3. A BD should be fused to the N-terminus of the Donor or the 
Acceptor. When the BD pair is fused to the C-terminus of the 
Donor and the Acceptor, a smaller increase in light intensity in 
response to an interaction is observed. Fluc consists of struc- 
turally distinct N- and C-terminal domains. Furthermore, the 
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10. 


ll 


13. 


14. 


15. 
16. 


I? 


C-terminal domain rotates as the reaction proceeds from the 
adenylation to the oxidative luminescent steps. As a result, 
when the BD is fused to the C-terminus, its rotation might be 
hampered. 


. Thioredoxin (Trx), which is originally encoded in the pET32 


vector, is fused to the N-terminus of BD. Trx is used to increase 
the protein solubility in the bacterial expression process. 


. FRB associates with FKBP12 in a rapamycin-dependent man- 


ner [8-10]. Rapamycin is known to act as immunosuppres- 
sant drug, anticancer agent, and accelerating agent for 
autophagy. 


. H245 and K443 are essential for the oxidative reaction of Fluc, 


and the mutations H245D and K443A selectively suppress the 
oxidative activity [5, 11, 12]. The mutation L530R stabilizes 
bound ATP in Fluc to accelerate adenylation activity [13]. E354K 
is a mutation that confers additional thermostability [14]. 


. QuikChange Multi Site-Directed Mutagenesis kit is also 


useful. 


. K529 is a residue that stabilizes phosphates of bound ATP, and 


the K529Q mutation selectively suppresses the adenylation 
activity [15]. The hinge region residue R437 is located close 
to the active site of Fluc in the adenylation conformation. 
The mutation R437K further decreases the distance between 
the residue and LH, and thus suppresses the adenylation 
activity [2]. 


. SHuffle T7 Express lysY is an engineered E. cold B strain in 


which DsbC is constitutively expressed. DsbC works as a chap- 
erone to promote the folding of protein. 


To remove secreted B-lactamase from the precultured E. colz, 
the medium of the preculture should not be added to the main 
culture. 


. From this step the samples must be kept on ice or at 4 °C. 
12. 


The purified enzyme can be stored at —80 °C in the buffer 
containing 15% glycerol for several months. 


The synthesis and use of LH,-AMP is optional. It is used to mea- 
sure LH,-AMP kinetics of BD1/Donor and BD2/Acceptor. 


The color of the solution turns yellow when the reaction is 
completed. 

LH,-AMP is stable in an acidic condition. 

The solution for elution should be cooled because LH,-AMP 
is unstable. 


The peak of the fluorescent spectrum of LH,-AMP is at 
approximately 535 nm, and the peak of the spectrum of LH, is 
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at 529 nm; therefore, the fractions containing LH,-AMP can 
be separated from the fractions containing LH). 


LH,-AMP can be stored at —80 °C for several months. 


The adenylation activity of BD1/Donor should be fully main- 
tained while that of BD2/Acceptor should be suppressed as 
much as possible. Therefore, these kinetics are advised to be 
determined before performing FlimPIA. 


When the kinetics for one substrate are determined, the con- 
centration of the other substrate should be maintained under 
saturating conditions. 


The oxidative luminescent activity of BD1/Donor should be 
suppressed, while that of BD2/Acceptor should be fully main- 
tained. Therefore, these LH,-AMP kinetics are preferably 
determined before performing FlimPIA. 


The maximum S/B ratio is usually observed 1 s after injecting 
substrates. The longer reaction time gives rise to higher back- 
ground luminescence due to the accumulation of LH,-AMP in 
the reaction mixture. 


When the substrate solution is injected into the reaction mix- 
ture, rapid mixing gives reliable result of light emission. 
Therefore, we recommend using a tube-based luminometer 
such as NU-2600 with stirring the mixture at top speed. 
However, a 96-well luminometer also gives a reasonably good 
result. 


Dehydroluciferyl-AMP (L-AMP), which is spontaneously pro- 
duced from LH,-AMP, acts as a competitor of LH,-AMP and 
inhibits light emission. Coenzyme A converts L-AMP to dehy- 
droluciferyl-coenzyme A, which does not act as a competitor 
of LH,-AMP. 

We previously used higher concentration (20 mM) of 
ATP. The ATP-based K,, of the Acceptor is increased due to 
K529Q mutation, while that of the Donor should remain 
unchanged. In the optimized FlimPIA, the final concentration 
of ATP is reduced to 1 mM, so that the residual adenylation 
activity of the Acceptor is selectively suppressed to lower the 
background signal. 


We thank B. Branchini for providing inspiration for the project. 
This project was supported by SENTAN, JST, Japan, and partly by 
Kikkoman Co., Japan, and the “Leave a Nest” Microtech Nichion 
award. 
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Chapter 9 


Quantitative and Dynamic Imaging of ATM Kinase Activity 


Shyam Nyati, Grant Young, Brian Dale Ross, and Alnawaz Rehemtulla 


Abstract 


Ataxia telangiectasia mutated (ATM) is a serine/threonine kinase critical to the cellular DNA-damage 
response, including DNA double-strand breaks (DSBs). ATM activation results in the initiation of a complex 
cascade of events facilitating DNA damage repair, cell cycle checkpoint control, and survival. Traditionally, 
protein kinases have been analyzed in vitro using biochemical methods (kinase assays using purified pro- 
teins or immunological assays) requiring a large number of cells and cell lysis. Genetically encoded biosen- 
sors based on optical molecular imaging such as fluorescence or bioluminescence have been developed to 
enable interrogation of kinase activities in live cells with a high signal to background. We have genetically 
engineered a hybrid protein whose bioluminescent activity is dependent on the ATM-mediated phosphoryla- 
tion of a substrate. The engineered protein consists of the split luciferase-based protein complementation pair 
with a CHK2 (a substrate for ATM kinase activity) target sequence and a phospho-serine /threonine-binding 
domain, FHA2, derived from yeast Rad53. Phosphorylation of the serine residue within the target sequence 
by ATM would lead to its interaction with the phospho-serine-binding domain, thereby preventing comple- 
mentation of the split luciferase pair and loss of reporter activity. Bioluminescence imaging of reporter 
expressing cells in cultured plates or as mouse xenografts provides a quantitative surrogate for ATM kinase 
activity and therefore the cellular DNA damage response in a noninvasive, dynamic fashion. 


Key words ATM, Bioluminescence, Complementation, In vivo, Kinase activity, Live cell, Molecular 
imaging, Reporter, Split-luciferase 


1. Introduction 


Protein kinases constitute one of the largest gene families, com- 
prising ~2% of the human genome. It is estimated that approxi- 
mately 30% of all cellular proteins are phosphorylated on at least 
one residue. Thus, protein kinases have key roles in many funda- 
mental processes of cellular signaling in cancer as well as normal 
cells. Biochemical methods have been widely used to investigate 
whether or not a protein kinase of interest is active. Although bio- 
chemical methods are robust in vitro, they generally do not pro- 
vide information about protein kinase activity in specific subcellular 
compartments; nor do they provide information about activity 
changes at the single-cell level. We and others have developed 
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Table 1 


Luciferase complementation-based reporters for imaging of kinase activities 


Phospho 

peptide-binding 
Kinase Substrate Peptide sequence domain Reference 
AKT FOXO4/AFX1 QSRPRSCTWPLPRPEKKK FHA2 [9] 
ATM CHK2 LETVSTQELYSI FHA2 [12] 
EGFR EPS15 KPANFSAYPSEEDMIE SH2 [2] 
FADD KINASES FADD QNRSGAMSPMSWNSDASTSEAS FHA2 [3] 
GSK3B/CKIa § $B-CATENIN SYLDSGIHSGATTTAPSLSG FHA2 [4] 
c-MET PYK2 LSESCSIESDIYAEIPDETLR SH2 [10] 
TGEBR SMAD2 LTQMGSPSVRCSSMS FHA2 [6] 


optical imaging reporters to measure the kinase activity of various 
oncologically important kinases ([1—-11], Table 1) and have utilized 
these reporters in subsequent studies that lead to the identification 
of new inhibitors and discovery of novel signaling mechanisms 
[12, 13]. 

Bioluminescence is a chemical reaction where light is emitted 
by a living organism. Luciferases are a large family of light- 
generating enzymes that catalyze the oxidation of a substrate, 
generically called luciferin, to yield oxyluciferin with the concomi- 
tant production of light. For in vivo bioluminescence imaging of 
malignancy, tumor cells or cancer-related genes are tagged with a 
reporter gene that encodes a light-generating enzyme, luciferase 
[14-16]. When this reporter is in the presence of the substrate it 
emits a blue to yellow-green light with an emission spectra peaking 
at a wavelength between 490 and 620 nm [14]. An extremely sen- 
sitive cooled charged-coupled device (CCD) camera or a photo- 
multiplier detects any low light that is emitted during the 
bioluminescence reaction. Due to its extreme sensitivity, broad 
dynamic range and exceptionally large signal-to-noise ratio, this 
type of noninvasive imaging permits a real-time analysis of an 
ample amount of various biological events [15]. Although there 
are more than 30 luciferase-luciferin systems that were derived 
independently of each other, the most frequently used luciferase 
for in vivo molecular imaging is the ATP-dependent firefly (Photinus 
pyralis) luciferase [17]. The reason for this is that 30% of the light 
produced by firefly luciferase has an emission spectra above 600 nm, 
a region in which the signal attenuation by the absorbing and scat- 
tering properties of live mammalian tissue is at a minimum [15, 17]. 
Recently, a very bright and smaller luciferase (NanoLuc; NLuc) 
from deep sea shrimp ( Oplophorus gracilirostris) has been successfully 
used for dual luciferase imaging in a mouse model [18]. 
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A significant advantage of cell-based bioluminescent kinase 
reporter is its adaptability for high-throughput screening. 
Bioluminescence generated in luciferase assays offers higher sensi- 
tivity than FRET-based systems due to amplification of the signal. 
In addition, luciferase is less susceptible to inference from nonspe- 
cific fluorescence of compounds. Thus, bioluminescence-based 
assays are highly suited for high-throughput screening. Furthermore, 
luciferase activity can be monitored dynamically and noninvasively, 
allowing bioluminescence-based cell assays to provide a unique 
method for identifying specific compounds that interact with the 
target in the correct cellular compartment and under normal cellular 
physiological conditions of that compartment (pH, concentrations 
of specific ions, etc). Reporters wherein the firefly luciferase enzyme 
has been divided into two halves (N-Luc and C-Luc) were originally 
developed to study protein-protein interaction [19]. These split- 
luciferase reporters were based on either the inter-molecular or 
intra-molecular complementation of the luciferase fragments to 
generate signal in response to cellular cues. 

Ataxia Telangiectasia Mutated (ATM) is a member of the PI3- 
like family of serine/threonine kinases. It is a very large 370 KDa 
protein encoded by human chromosome 11q22-23. It plays a criti- 
cal role in repair of DNA double-stranded breaks (DSBs) thereby 
maintaining genomic stability. These processes include, but are not 
limited to, DNA replication, DNA repair, cell cycle progression, 
apoptosis, and senescence. ATM exists in its inactive form as a non- 
covalently linked dimer where the kinase domain of one monomer 
is bound to the internal domain of another monomer covering the 
$1981 residue. In response to DSBs, the kinase domain of one 
monomer phosphorylates $1981 of the other interacting ATM 
resulting in subunit dissociation, ATM activation, and recruitment 
to DNA break sites [20]. Ionizing radiation-induced ATM activa- 
tion results in the activation of a large number of ATM substrates 
[21-26] including P53, MDM2, SMC1, KAP1, BRCAI, yH2AX, 
and CHK2. The activated ATM triggers a sequence of events 
including cell cycle arrest, allowing time for the repair of the dam- 
aged DNA in syne with circadian rhythm [27]. If damaged DNA is 
left unrepaired it can lead to cell death, genomic instability, cancer, 
and/or other pathologies [28]. The 2015 award of the Nobel 
Prize in Chemistry for the discovery of DNA repair mechanisms 
highlights the importance of this pathway. Because of the impor- 
tant role ATM plays in cancer, therapeutics have been devised to 
target it [29]. 

In vitro kinase assays using purified substrate and kinase are 
routinely used to evaluate kinase activity. For traditional cell-based 
studies, immunohistological and biochemical techniques have 
been utilized for evaluating the kinase activity of ATM, such as 
counting pATM foci, yH2AX foci, immunofluorescence, or 
immunoprecipitation-western blotting [26, 30, 31]. Johnson, 
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2 Materials 


2.1 Molecular 
Biology 


2.2 Cell Culture 


You, and Hunter [11] described a fluorescence resonance energy 
transfer (FRET )-based biosensor for monitoring ATM kinase activ- 
ity in live cells. Although this reporter provides direct measure- 
ment of the ATM kinase activity, it has a limitation of usability in 
mouse model due to tissue penetration and autofluorescence in the 
CFP-YFP range. In this book chapter, we provide detailed meth- 
ods of use for the recently developed split firefly-based biolumines- 
cence reporter to noninvasively, dynamically, and sensitively 
measure ATM kinase activity in live cells and mouse models [7]. 


0 ON BD 


. DNA encoding open reading frame for phospho-protein- 


binding domain (e.g., Rad53p FHA2 domain). 


. Full-length coding sequence for the Firefly luciferase (FLuc) or 


plasmids coding for the N-terminal luciferase (amino acids 
1-416) and C-terminal luciferase (amino acids 398-550) 
fragments. 


. Expression vectors with constitutive promoters for expression 


in mammalian cells (e.g., pEF vector from Clontech). 


. Optional: Expression vectors and packaging plasmids for 


generating lentiviral particles. 


. High fidelity DNA polymerases (e.g., Pfu DNA polymerase), 


dNTPs, oligonucleotide primers for cloning and sequencing, 
PCR reaction buffers, PCR Thermocycler, restriction endonu- 
clease, DNA ligase, site-directed mutagenesis kits, high efficiency 
competent cells, antibiotic, bacterial growth media (LB, SOC), 
agar plates, plasmid DNA extraction kits, DNA gel purification, 
and sequencing kits. 


. HEK293T cells or other readily transfectable cell lines. 


. Desired cell line(s) for biologic question of interest (1.e., 


D54, U87). 


. Fetal bovine serum (FBS). 
. Complete growth medium with FBS: Growth media with 10% 


FBS, and 1% penicillin/streptomycin. 


. Serum-free medium: Growth media without FBS and penicillin/ 


streptomycin. 


. Trypsinization medium: 0.05% Trypsin-EDTA. 

. 1000x penicillin stock solutions: 10,000 Units/mL penicillin. 
. 1000x streptomycin stock solution: 10 mg/mL streptomycin. 
. 1000x geneticin/G418 stock solution: 50 mg/mL geneticin/ 


G418. 


2.3 Cell Imaging 


2.4 Animal Imaging 


10. 


ll 


1. 
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Sterile 1x phosphate-buffered saline (PBS) solution. 


. Transfection reagents. 
12. 


Filter paper discs and forceps for isolation of stable clones. 


Black-walled or white-walled 96-well clear-bottom plates for 
tissue culture. 


2. Sterile low adherence pipette tips with barrier filter. 


3. Firefly luciferase substrate: 4 mg/mL pb-luciferin 40x stock 


COND OB 


10. 


11. 


1. 


solution in PBS stored in dark colored vials at —80 °C. 
Alternatively, GloSensor c-AMP reagent (Promega) can be 
used to monitor firefly luciferase activity. 


. ATM inhibitor: 1000x stock solution 3 M caffeine. 

. ATM inhibitor: 1000x stock solution 20 mM KU-60019. 
. ATM inhibitor: 1000x stock solution 20 mM KU-55933. 
. ATM inhibitor: 1000x stock solution 20 mM CGK733. 


. Live cell bioluminescence imaging system with very high 


sensitivity and required software package for data generation 
and analysis (IVIS, Envision Xcite multi-label Plate Readers 
from Perkin Elmer; or similar system). 


. Liquid handling instrument (Biomek Nx from Beckman or 


similar). 
Plate handling robot (Plate Handler I] robot from Perkin 
Elmer or similar). 


Optional: Cell culture incubator compatible with high- 
throughput instruments (LiCONiC StoreX STX44 IC preci- 
sion incubator from Liconic Instruments or similar) for 
high-throughput assays. 


Appropriate mouse strain for desired experimental system such 
as immunocompromised mice (nude, SCID, or NSG) for 
human tumor xenografts. 


2. Optional: Small animal shaver such as Wahl trimmer. 


3. 40 mg/mL luciferin stock in PBS, store in tightly sealed dark 


tubes at —20 or —80 °C. 


. 28-30 gauge insulin syringe for intra-peritoneal (IP) luciferin 


injection in mice. 


. Bioluminescence imaging instrument (IVIS or similar instrument) 


with a heated platform and isoflurane anesthesia injection and 
controller systems. 


. Additional accessories such as nose cones, animal partitions, 


black paper sheets, ear tags, markers, 70% alcohol, and 10% 
bleach or similar solution for disinfecting bench surfaces. 
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3 Methods 


3.1 Construct Firefly 
Luciferase 
Complementation- 
Based ATM kinase 
Activity Reporter 


1. Select a substrate such as CHK2 and determine the length of 
the substrate sequence that can be used for the construction of 
the reporter (see Note 1). We typically select 12—20 amino acid 
long substrate sequences with the target residue/s at the center 
of the sequence where possible (Table 1). For the construction 
of the ATM kinase reporter (ATMR), we selected a 12-residue 
sequence derived from CHK2 (Figs. 1 and 2). 


2. Add a 5-7 amino acid long linker sequence at both the ends of 


the substrate sequence. We typically use GGSGG as the linker 


in our kinase reporters. For Ser/Thr kinases, attach a phospho- 
peptide-binding domain such as FHA2 (residues 420-582) 
[32]. For Tyr kinases, attach a SH2 domain (residue 374-465 
of mouse shc2 [33, 34]). Use appropriate N-terminal (N-Luc) 


and C-terminal (C-Luc) firefly luciferase fragment pairs [19] at 


the flanks. 
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Fig. 1 The DNA-coding sequence and translated amino acid sequence for all the domains of the ATM kinase 
activity reporter. In frame short linker sequences (linker) inserted between each functional domain provide 
flexibility for the intramolecular domain interaction in the chimeric reporter molecule. N-Luc denotes amino 
acids 1-416 of firefly luciferase and C-Luc denotes amino acids 398-550. The target peptide sequence was 
derived from the CHK2 coding sequence (amino acids 63-74). The Ser/Thr phospho-peptide-binding domain 


(FHA2) comprises amino acids 420-522 of Rad53P protein 
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Fig. 2 The components and functional basis of the ATM kinase activity reporter. 
(a) The ATM reporter consists of a phospho Ser/Thr-binding domain (FHA2), sub- 
strate peptide (CHK2), and split firefly luciferase. The substrate sequence is 
flanked by short linker sequences at either end. The functional basis of the 
reporter is demonstrated in part figure (b). In the presence of ATM kinase activity, 
the CHK2 target peptide is phosphorylated, resulting in interaction with the FHA2 
domain, producing stearic constraints that inhibit functional reconstitution of the 
luciferase. In the absence of ATM kinase activity such as by small molecule 
inhibitors, sIRNA-mediated knockdown (of ATM), or over-expression of phospha- 
tases, the CHK2 consensus sequence is hypo-phosphorylated, allowing for lucif- 
erase enzyme reconstitution and increased bioluminescent activity 


3. Generate chimeric reporter constructs using appropriate 
molecular biology procedures. We generally incorporate linkers, 
substrates, and restriction enzyme sites in primers. We typically 
confirm the clones by sequencing, construct expression by 
Western blotting and functional bioluminescence assays before 
proceeding to generate stable cell lines. 


4. Generate relevant control constructs wherein the phospho- 
target residue/s within the substrate is mutated to neutral 
amino acids. We typically mutate Ser/Thr or Tyr to Ala in 
mutant reporters by site-directed mutagenesis (Fig. 2a). 


5. Express complementation reporters in appropriate vectors for 
mammalian cells. Vectors should be selected with markers, such 
as antibiotic resistance genes or co-expressed fluorescent proteins 
that are suitable for generating stable cell lines. We typically use 
pEF vector in combination with the geneticin/G418 antibiotic 
for stable clone selection. 
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3.2 Cell-Based 
Bioluminescence 
Imaging of ATM 
Kinase Activity 


. We typically do all of our cell-based and in vivo bioluminescence 


assays using the reporter expressing stable cell lines. We care- 
fully select cell lines that represent an appropriate cellular and 
biological context for our studies (see Note 2). Cells are trans- 
fected with the reporter plasmids and allowed to grow under 
the antibiotic selection media. We typically pick 12-24 single- 
cell clones using sterilized filter paper discs and choose the best 
clones by measuring bioluminescence in response to specific 
kinase inhibitors (see Notes 3 and 4). 


. The three best reporter expressing stable cell lines (clones) 


are expanded and frozen at low passages for future use. 
Cells are maintained in 10 cm dishes with complete growth 
media containing serum and the appropriate amount of G418 
(see Note 5). 


. Stable cell lines are plated overnight in black-walled or white- 


walled, clear-bottom 96-well plates for live cell assays. Cell 
density should be 2500-10,000 cells per well in 100 pL com- 
plete growth medium with serum (see Notes 6 and 7). 


. Cell culture media is removed and ATMR expressing cells are 


treated with different concentrations of ATM inhibitors such 
as caffeine, KU-60019, KU-55933, or CGK733 in serum-free 
media (100 pL per well). 


. After 5 min, 2.5 pL p-luciferin (black-walled plates) or cAMP- 


Glo reagent (white-walled plates) is added by multichannel 
pipette into each well for a working concentration of 100 pg/mL 
of firefly luciferase substrate. 


. Black-walled 96-well plates are imaged on the IVIS imaging 


system as soon as possible after adding luciferin. Typically, 
bioluminescence is acquired for 30-60 s at medium binning. 
For a time-course, the images are acquired with 3-10 min 
delay between the reads (Fig. 3a, b). 


. The white-walled plates are read on the Envision system right 


after the addition of the substrate. Generally, each well of the 
plate is read for 0.01-1.0 s. For a longer time-course activity 
measurement, a delay of 15-60 min between each read is set 
(Fig. 3c). For each read, the robot takes the plate out from the 
incubator, loads it on the reader where the plate is read, and is 
transferred back to the incubator until the next time point 
(see Note 8). 


. Quantify bioluminescence acquired on IVIS system by region- 


of-interest (ROT) analysis using Living Image software. The 
bioluminescence data from Envision system is automatically 
saved in quantitative form in tab-delimited file format. 


. Since radiation activates ATM within minutes, the biolumines- 


cence activity of ATMR can be evaluated within 15 min after 
irradiation. 
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Fig. 3 Firefly complementation-based live cell assay for noninvasive monitoring of ATM kinase activity. (a) D54 
Cells stably expressing the ATM kinase activity reporter (ATMR) were plated in black-walled 96-well plates 
(5000 cells/well). Cells were incubated with mock (DMSO) or an increasing concentration of ATM kinase inhibi- 
tor KU-55933, which increased the signal in a dose-dependent way. (b) A representative image of the biolumi- 
nescence acquired in response to various concentrations of KU-55933 is shown. ROI in a grid is created and 
overlaid in the pseudocolored image to quantitate the photons emitted. Scale bar shows photon flux in pseu- 
docolor with b/ue as the lowest and red as the highest counts. (¢) D54-ATMR cells plated in white-walled 
96-well plates and treated as described above and read on Envision plate reader 


10. 


3.3 In Vivo Imaging 1. 


of ATM Kinase Activity 


All the bioluminescence measurements should be validated by 
Western blotting in ATMR expressing cell lines in parallel 
experiments. 


D54-ATMR cells are expanded, trypsinized, and suspended in 
serum-free media at 40 x 10° cells/mL. 50 pL of this suspen- 
sion is injected into each flank (2 x 10° cells) in nude mice 
using a 22-gauge needle. We usually wait until the tumor 
reaches 60-100 mm size (3-4 weeks) before starting the 
experiments. 


. We acquire baseline bioluminescence measurements 3-6 h 


before starting the treatment (Fig. 4a). Each mouse is injected 
with 100 pL p-luciferin (4 mg/mL stock prepared in sterile 
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Fig. 4 In vivo measurement of ATM kinase activity in mouse tumor xenograft model. (a) CD-1 nude mice harboring 
D54-ATMR WT tumor xenografts were injected with luciferin and bioluminescence was acquired as described 
3 h before treatments. (b) The animals were injected with KU-55933 (25 mg/kg) or vehicle control (DMSO) and 
bioluminescence was acquired 1, 4, 8, and 24 h posttreatment. The ATM reporter fold activation upon ATM 
inhibition is plotted over mock treatment. (c) Similarly, mouse harboring D54-ATMR WT tumor xenografts 
were whole body irradiated with 5Gy of radiation or sham irradiated and bioluminescence was measured for 
up to 24 h. About 70% decrease in the reporter activity was observed 8 h post irradiation 


PBS; 400 ppg per mouse) anesthetized with 1—2% isoflurane for 
5 min (see Note 9). 


3. Transfer mice to the bioluminescence instrument, where they 
are maintained under anesthesia, and acquire bioluminescence. 
We typically acquire data on five mice at once isolated by a 
plastic separator. Generally, a 15-30 s acquisition at medium 
sensitivity is sufficient. We typically acquire data for 10-20 
reads with a 1-5 min delay between the reads to cover the bio- 
luminescence peak from all the tumors in each of the mice. 


4. Treat the mice with appropriate inhibitors such as KU-55933 
(both 25 mg/kg), or activators such as radiation (5 Gy) and 
monitor bioluminescence over time. Vehicle control (DMSO) 
or sham-irradiated mice should be used as control (Figs. 3c 
and 4b; see Note 10). 


3.4 Conclusions 


4 Notes 
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5. Remove mouse from imaging instrument and monitor for 
complete recovery from anesthesia. 


6. Quantify imaging data by region-of-interest (ROT) analysis of 
bioluminescence produced by the tumor, using units of photon 
flux (Fig. 4a; see Notes 11-17). 


The method described herein is an adaptation of the traditional 
protein complementation assay for the detection of protein-protein 
interaction in live cells. Instead of monitoring the interaction of 
two proteins through the use of split reporter molecules, we have 
adapted the assay such that the interaction between the “bait” and 
the “prey” occurs in response to the activity of a specific kinase. 
The kinase can be a serine/threonine- or a tyrosine-kinase. The 
reporter has also been engineered such that increased complemen- 
tation (and therefore reporter activity) occurs in response to 
decreased kinase activity. This approach is therefore very well suited 
for high-throughput screens for kinase inhibitor libraries since a 
positive hit would be detected as an increase in bioluminescence 
activity, thereby less likely to result in false positives. We have also 
used analogous reporters for whole genome siRNA screens. As an 
example, a reporter for TGF-f receptor serine/threonione kinase 
activity was used in a human kinome siRNA screen to yield a num- 
ber of novel genes as regulators of the TGF-B receptor function 
[12]. Regulation of the molecular events that lead to the activation 
and/or inactivation of the ATM kinase activity is yet to be defined; 
therefore, it is anticipated that analogous whole genome siRNA 
screens against the ATMR will most likely yield new insights into 
the role of novel genes in the regulation of the cellular response to 
DNA damage. 


1. Substrate sequence for the construction of the reporter should 
be decided based on literature searches. It is imperative to have 
good antibodies available to detect changes in the substrate 
phosphorylation as this will help in validating the kinase 
reporter bioluminescence data by biochemical techniques. 


2. The selection of a specific cell line for creating a reporter- 
expressing stable line should be based on the specific questions 
being interrogated and the intrinsic activity and detectability of 
the kinase and the substrate in the cell line. This should be 
determined by experimentation as well as literature searches. 
We selected the D54 glioblastoma cell line because ATM inhi- 
bition in glioblastoma may sensitize them to IR and chemo- 
therapy [35]. D54 cells express wild-type ATM protein and 
responds to ATM inhibitors and irradiation as seen by Western 
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immunoblotting using antibodies against ATM and its substrate 
CHEK2. 


3. We generally select for clones that exhibit low to moderate 


bioluminescence after adding luciferin. Since it is an activatable 
reporter that shows an increase in the light with inhibition of 
the ATM kinase, clones expressing the reporter in very high 
abundance may not yield high signal to background and thus 
may show a limited fold activation in response to inhibitors. 


. For picking up the clones, complete growth media with antibi- 


otic is aspirated from the tissue culture dishes containing single- 
cell clones. Sterilized filter paper discs pre-wetted in trypsin are 
transferred to clones using sterilized forceps in a laminar flow 
hood. After 4-5 min, the discs are lifted from the tissue culture 
plate and swirled around in a 24-well plate containing complete 
growth media with half the concentration of antibiotic used for 
clone selection. Generally, 12-24 single-cell clones are picked. 
Forceps are sterilized either by heat or by dipping them in 
ethanol for 5-10 min. Make sure to let the ethanol evaporate 
before using the forceps to pick up the clones. 


. For expanding and maintaining of stable cell lines, we use half 


the concentration of G418 than that was used for selection. 
The G418 concentration for stable clone selection should be 
empirically decided. 


. The N-Luc and C-Luc fragments used in the construction of the 


complementation-based ATM reporter are derived from a firefly 
luciferase that has been optimized to work in mammalian cells 
at the physiological temperature of 37 °C. Therefore, all the 
bioluminescence acquisitions should be performed at 37 °C. 


. Since this reporter is based on the complementation of the 


light-generating enzyme luciferase, it works only in live cells 
under physiological conditions. Methods wherein cell lysate is 
used to measure the bioluminescence signals are incompatible 
with this reporter system and will not be able to yield any 
detectable change in signal to background (under different 
treatment conditions). 


. The Envision system is built with ultra-sensitive luminescence 


detection technology; thus 0.01—0.1 s measurement for each 
well is usually sufficient. The detection time can be increased if 
the signal is very weak and desired signal-to-noise is not reached 
with lower detection times. Furthermore, the bioluminescence 
signal from cAMP-Glo reagent is stable for a prolonged period 
of time, allowing us to measure the kinase activity of ATM for 
longer periods. 


. In our hands, we find that 400 pg luciferin/mouse gives us the 


best signal-to-background bioluminescence readings for split 
firefly luciferase-based kinase reporters. For smaller tumors or 


10. 


11. 


12. 


13. 


14. 


15, 
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tumors generated from cell lines expressing very low levels of 
reporter, the amount of luciferin can be increased to 150 mg/kg 
body weight (i.e., 3.5-4 mg/mouse with 20-25 g average 
mouse weight). 


One may need to empirically determine the optimal inhibitor 
concentration for the best signal-to-noise bioluminescence 
detection in vivo. We usually test two to three different con- 
centrations of the drugs in mouse tumor xenograft model to 
find the optimal concentration that gives highest fold change 
over vehicle control without being toxic to animals. 


We create separate region-of-interest (ROI) for each tumor 
based on its size and shape. We also make sure that this ROI 
does not overlap with the ROI of any other tumors. We copy- 
paste the same ROI for each tumor for counting total photon 
flux for all the time points. We may move the position of the 
ROI so that it covers the tumor (because the position of the 
same animal with the same tumor might be slightly different 
between different time points) but do not change the overall 
shape or size of any ROI. This removes the chances of includ- 
ing background photon counts emanating from mouse skin. 


We generally use four to five mice in an experimental group for 
bioluminescence data acquisition on the IVIS Spectrum sys- 
tem. Based on the number of animals in a group, we choose a 
stage level (distance of the CCD camera and the subject) and 
use the same stage level for the whole experiment. Changing 
the distance between the CCD camera and the subject height 
between different time points or reads would lead to different 
photon counts which will be difficult to analyze. 


We usually use medium sensitivity settings with 15-60 s data 
collection time for each read. One should acquire photon flux 
without any saturated pixels. All the saturated pixels (above 108 
counts) show the same value, and therefore, cannot be accu- 
rately analyzed. 


We typically use the maximum photon flux emitted from each 
tumor separately for all the calculations. Each tumor may show 
the maximum emittance at different periods of time; therefore, 
we usually perform sequential reads with a delay between each 
read which allows us to collect the maximum photon flux for 
all the tumors. 


We typically use the maximum photon count for each tumor 
for each time for the analysis. Ifa tumor does not respond at any 
time-point, we omit that reading from the analysis. We analyze 
and plot combined values of all the tumors from one treatment 
group. Photon counts from vehicle (DMSO) or sham treated 
animals are set as onefold and the ratios of other treatments are 
counted as fold change from vehicle/sham treatment. 
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16. Due to the sensitivity of the imaging system, one typically gets 
some background bioluminescence in nude and SCID mice. It is 
important to shave and NAIR® the mice if using a mouse strain 
with hair to reduce the background and determine the correct 
size, shape, and position of a ROT. 


17. 


Bioluminescence data acquired in a mouse xenograft model 


should be validated by biochemical methods such as Western 
blotting or immunohistochemistry (IHC). For validation of 
the bioluminescence data for ATMR, tumor tissue should be 
analyzed with pATM and pCHk2 antibodies after control, 
KU-55933, KU-60019, or radiation treatments. 
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Part IV 


6-lactamase Sensors 


Chapter 10 


Creation of Antigen-Dependent (-Lactamase Fusion 
Protein Tethered by Circularly Permuted Antibody 
Variable Domains 


Hiroto Iwai, Miki Kojima-Misaizu, Jinhua Dong, and Hiroshi Ueda 


Abstract 


Antibody-based molecular switches that are able to recognize a range of exogenous antigens can be highly 
useful as a versatile biosensor. However, regulating the catalytic activity of enzymes by antibodies is still 
hard to achieve. Here, we describe a design method of unique antibody variable region Fv introduced with 
two circular permutations, called Clampbody. By tethering the Clampbody to a circularly permuted 
TEM-1 f-lactamase (BLA), we successfully constructed a genetically encoded molecular switch Cbody-cpBLA 
that shows antigen-dependent catalytic activity. 


Key words TEM-1 f-lactamase, Antibody variable region, Circular permutation, Allosteric regulation, 
Immunosensor 


1. Introduction 


Regulating the catalytic activity of enzymes by exogenous sub- 
stances is potentially useful for the detection of biomarkers, envi- 
ronmental pollutants, and for enzyme-based prodrug therapies 
[1, 2]. To create the molecular switches that can recognize a wide 
variety of molecules, methods to control the catalytic activity of an 
enzyme by means of antibody domains are considered highly 
important [3]. 

To date, there have been a number of fusion proteins of anti- 
body domains and enzymes whose catalytic activity was regulated 
by the binding antigen-dependent binding to their cognate anti- 
body [4, 5]. Recently, we reported a fusion protein consisting of a 
circularly permuted TEM-1 f-lactamase (cpBLA) and Fv, called 
Fy-cpBLA [6]. This protein switch was made on a cpBLA that had 
been successfully used to make a maltose-dependent switch when 
fused with a circularly permuted maltose- binding protein (cpMBP) 
[7-9]. Fv-cpBLA showed antigen-dependent catalytic activation 
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and antigen-dependent growth-enhancement of E. coli expressing 
the fusion protein in the presence of ampicillin. However, the 
increase of the specific activity was only up to 25% and there was a 
room for improvement. 

More recently, aiming to transduce the antigen-dependent 
binding signal more effectively to the catalytic domain, we tried to 
optimize Fy-cpBLA by recombining the antibody domains to 
cpBLA more directly instead of using commonly employed flexible 
linkers [10]. The distance between the N- and C-termini of anti- 
body variable domain Vy and V, is between 30 and 40 A. This is 
considered too long and may disrupt the optimal orientation of 
protein fragments or the conformation of circularly permuted 
enzymes. Therefore, based on the previous report [11], we focused 
on the 3 and 3b loops located far from antigen-binding loops but 
near the domain interface between Vy and V, (Fig. la). Thus, we 


Fig. 1 Creation of Clampbody and its fusion to cpBLA. (a) Schematic structure of single chain Clampbody (sc- 
Chody). cpV, and cpV, chains are shown in magenta and cyan, respectively. The (G,S)3 linkers connecting their 
native N- and C-termini are drawn as dashed lines. Two cysteine residues were inserted at the N- and C-termini 
of cpV, to promote correct folding via disulfide linkage. The SG, linker connecting cpV, and cpV, is drawn as 
dotted line. (b) Schematic structure of Cbody-cpBLA, in which the structure of cpBLA (PDB code 4DXB) is 
shown in green. (c, d) Expression and purification of Cbody-cpBLA. (c) CBB-stained SDS — PAGE. (7) MW 
marker, (2) concentrated culture supernatant, (3) soluble fraction from the cell lysate, (4) insoluble fraction 
from the same, (5) solubilized insoluble protein, (6) purified and refolded protein. (d) Western blot to detect the 
His tag. The lanes are the same as in (c). The bands for Cbody-cpBLA are indicated by arrows. Reproduced 
from Iwai et al. [10] with permission from American Chemical Society 


2 Materials 


2.1. Construction 
of Clampbody Genes 
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designed a novel antibody Fv domain format whose termini were 
located at these sites, and named this doubly circularly permuted 
antibody Fv as Clampbody. In addition, we also designed a fusion 
protein comprising Clampbody and cpBLA by connecting cpVy 
and cpV; to the N- and C-termini of cpBLA [7] with a minimum 
number of linker residues between them (Clampbody-cpBLA, 
Cbody-cpBLA) (Fig. 1b). 

We demonstrated the specific antigen binding of Clampbody, 
and more importantly, the antigen-dependent catalytic activity of 
novel molecular switch Cbody-cpBLA. Interestingly, we also found 
that the antigen-dependency of Cbody-cpBLA became more sig- 
nificant in reaction buffers containing some denaturant or deter- 
gent such as urea and Triton X-100. From these results, we 
reasoned that Cbody-cpBLA was stabilized by the binding of the 
antigen peptide, and therefore showed antigen-dependent catalytic 
activity. This would be a good example of the allosteric regulation 
driven by ligand-induced stabilization or thermodynamic shift, 
which is collectively called “ensemble” model [12-15]. By opti- 
mizing the reaction condition, Cbody-cpBLA could detect their 
antigen peptide specifically at single nM level. Considering the 
variety of available substrates, Cbody-cpBLA would be applied to 
biosensors, prodrug therapies, and possibly to bacterial growth 
regulation. We also propose Clampbody as a useful tool in antibody- 
based sensing proteins such as protein complementation assay 
[16], fluorescence resonance energy transfer-based assay [17, 18], 
and transduction assay of reaction intermediate [19, 20]. Taking 
advantage of the proximity of their termini, Clampbody might be 
fused with many protein domains and fragments, and therefore be 
able to control the distance and orientation of fused protein(s) in 
an antigen-dependent manner. 


Prepare all solutions using ultrapure water (made by purifying 
deionized water to attain resistivity of 18 MQ cm at 25 °C). For all 
the reagents, use them of the highest grade available. The chemi- 
cals and other reagents, unless otherwise indicated, were obtained 
from Sigma (St. Lois, MO), Nacalai Tesque (Kyoto, Japan), or 
Wako Pure Chemical Industries (Osaka, Japan). 


1. Oligonucleotides (see Table 1) (e.g., Greiner Japan, Tokyo, 
Japan; Eurofins genomics, Tokyo, Japan). 

2. High-fidelity thermostable DNA polymerase (e.g., KOD FX, 
Toyobo, Osaka, Japan; PrimeSTAR® Max DNA Polymerase, 
Takara-Bio, Shiga, Japan). 
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2.2 Protein 
Expression 


2.3 Refolding 
of Clampbodies 


2.4 Western Blot 
of Clampbodies 
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. DNA purification kits (e.g., Wizard® SV Gel and PCR 


Clean-Up System, Promega, WI). 


. Restriction enzymes (e.g., New England Biolabs, Beverly, MA). 
. DNA ligation kit (e.g., Ligation high ver. 2, Toyobo). 

. BGP C-terminal peptides (e.g., Genscript, Piscataway, NJ). 

. pET26/Fv-cpBLA (previously described [6]). 

. pIT2(13CG2) (previously described [21]). 

. pET30b(+) (Novagen, Darmstadt, Germany). 

. pCR4Blunt-TOPO (Zero Blunt TOPO PCR Cloning Kit for 


Sequencing, Thermo Fisher Scientific, Waltham, MA). 


. In-Fusion® HD cloning kit (Clontech, CA). 
. Tris-EDTA buffer: 10 mM Tris-HCl, 1 mM EDTA, pH 7.4. 


. SHuffle T7 express lysY competent cells (New England Biolabs). 
. LB medium: 1% tryptone, 0.5% yeast extract, 0.5% NaCl, pH 7.0. 
. LBK medium: LB containing antibiotics (50 g/mL kanamycin). 


Agar. 


. 1 M Isopropyl-B-d-thiogalactopyranoside (IPTG). 

. Supersonic homogenizer (e.g., VP-15S Taitec, Saitama, Japan). 
. 2-mercaptoethanol. 

. 0.5 M EDTA, pH 8.0. 

. Guanidium hydrochloride (Gdn-HCl). 

. TALON extraction buffer: 300 mM NaCl, 50 mM sodium 


phosphate, adjusted to pH 7.0. 


. TALON metal affinity resin and TALON disposable gravity 


column (Clontech, CA). 


TALON elution buffer: TALON extraction buffer supplement 
with 200 mM imidazole. 


Pierce™ BCA protein assay kit for determining protein con- 
centrations (Thermo Fisher Scientific, Waltham, MA). 


. Slide-A-Lyzer Dialysis Cassettes, 3.5 K MWCO, 3 mL 


(Thermo Fisher Scientific). 


. Dialysis buffer: 50 mM Tris-HCl, 200 mM NaCl, 1 mM 


EDTA, pH 8.0. 


SDS-PAGE gradient gels (5 ~ 20%) (e.g., e- PAGEL, ATTO, 
Tokyo, Japan). 


. SDS running buffer: 25 mM Tris-HCl, 192 mM 1-glycine, 


1 g/L sodium dodecyl sulfate. 


. PVDF membrane (e.g., Clear Blot Membrane —P plus, ATTO). 
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2.5 ELISA 
for Clampbodies 


2.6 Enzyme Assays 
for Clampbodies 


4. Transfer buffer: 25 mM Tris-HCl, 192 mM I-glycine. 


. Tris buffered saline (TBS): 50 mM Tris-HCl, 150 mM NaCl, 


pH 7.4. 


. TBS with Tween (TBST): TBS supplemented with 0.05% of 


Tween 20. 


. TBST with skim milk (TBST-S): TBS-T supplemented with 5% 


skim milk. 


. HRP-conjugated anti-His, antibody (e.g., Roche Diagnostics 


Japan, Tokyo, Japan). 


. Amersham ECL Prime (GE Healthcare, UK). 
10. 


Chemiluminescence imager (e.g., LAS-4000 = mini 
Luminoimager, Fujifilm, Tokyo, Japan). 


1. Transparent 96-well microplates (e.g., Greiner 655,061). 


10. 


. 10 pg/mL streptavidin (e.g., type Hl, Wako, Osaka, Japan). 
. Phosphate buffered saline (PBS): 1.47 mM KH,POg, 8.1 mM 


Na,HPO,, 2.7 mM KCl, 137 mM NaCl, pH 7.4. 


. PBST: PBS containing 0.1% Tween 20. 

. Blocking solution (e.g., Immunoblock, DS Pharma, Osaka, Japan). 
. Ll pg/mL N-terminally biotinylated BGP-C11 peptide in PBST. 
. TMBZ reaction buffer: 100 pg/mL TMBZ (3,3’,5,5’-tetra- 


methylbenzidine), 0.006% H,O., 100 mM NaOAc, pH 6.0. 


. 10% H,SO, to stop the reaction. 


. Black 96-well microplates (e.g., Corning Costar 3693). 

. Transparent 96-well microplates (e.g., Greiner 655061). 

. Dimethyl sulfoxide (DMSO). 

. Fluorocillin™ Green 495/525 B-Lactamase Substrate, soluble 


product (Thermo Fisher Scientific, Waltham, MA). 1 mM 
stock solution in DMSO can be stored for several months 
at —30 °C. 


. Nitrocefin (e.g., Calbiochem, Darmstadt, Germany). 20 mM 


stock solution in DMSO can be stored for several months 
at —30 °C. 


. PBS containing 1% Triton X-100. 
. PBS containing 3 M urea. 
. PBS containing 20 mg/mL bovine serum albumin (BSA) 


(e.g., Sigma-Aldrich, St. Lois, MO). 


. A fluorescence microplate reader (e.g., GENios Pro Tecan, San 


Jose, CA). 


A microplate reader (e.g., SH-1000, Corona Electric, Hitachi, 
Ibaraki, Japan). 


3 Methods 


3.1 Construction 

of the V,, and V, genes 
for Anti-BGP 
C-Terminal Peptide 
Antibody 


3.2 Construction 
of the cpV, Gene 
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To create a Clampbody as an antigen-dependent switch, any anti- 
body for small molecule antigen, whose variable region 
(Fv = Vy + V;) is stabilized by the bound antigen, will suffice. 
Here, we employ the Vy and V;, domains of antibody KTM219 
[22], which recognizes the C-terminal peptide of the human bone 
Gla protein (BGP) as a model. First, clone the Vy and V;, genes by 
the following procedure. 


1. Amplify the DNA fragment encoding Vy fragment of the anti- 
body by PCR using the primers NcoVHrev and VHSpefor, 
pET26/Fv-cpBLA as a template, and KOD FX DNA poly- 
merase and the DNA fragment encoding V, fragment of the 
antibody by PCR with AgeVLrev and VLNotfor as primers. 


2. Purify the resulting DNA fragments by Wizard® SV Gel and 
PCR Clean-Up System as per manufacturer’s instructions. 


3. Ligate these fragments into pCR4Blunt-TOPO as per manu- 
facturer’s instructions. 


Make a construct for a circularly permuted Vy; (cpViz), whose new 
termini are generated between Pro41 and Gly42 (as described 
according to Kabat numbering) while the native N- and C-termini 
are connected via a flexible linker (G,4S)3;. Considering the possible 
lesser stability of Viz fragment, two Cys residues are added at the 
N- and C-termini of cpVy to promote the correct folding and 
increased stability through the disulfide bridge. 


1. Amplify the DNA fragment encoding Gly42 to Ser113 of Vy 
with a linker peptide (G,S)3; at the C-terminus by PCR using 
the primers NcoCcpVHrev and VHG4Sfor, pET26/Fv-cpBLA 
as a template, and KOD FX DNA polymerase. 

2. Amplify the DNA fragment encoding Glul to Pro41, described 
according to Kabat numbering, with the linker at the 
N-terminus by PCR using G4S3VHrev and cpVHCSpefor as 
primers. 


3. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 
System. 


4. Perform overlap-extension PCR with the gel-purified PCR 
products (20 ng each) and KOD FX for 15 cycles without 
primers and 35 cycles with NcoCcpVHrev and cpVHCSpefor 
primers. 


5. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 
System. 


6. Ligate into pCR4Blunt-TOPO. 
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3.3 Construction 
of the cpV, Gene 


3.4 Construction 
of the cpBLA(RG13) 
Gene 


3.5 Construction 
of Cbody-cpBLA 
Fusion Protein 
Expression Vectors 


Similarly, make a construct for the circularly permuted V;, (cpV;) 
with new termini between Pro40 and Gly41. In this case, two Cys 
residues at the N- and C-termini of cpV; are not added to avoid 
unfavorable interdomain crosslinking at the later stage. 


1 


6. 


. Amplify the DNA fragment encoding Gly41 to Arg108 of V; 


with a linker peptide (G,S)3 at the C-terminus by PCR using 
the primers AgecpVLrev and VLG4Sfor, pET26/Fv-cpBLA as 
a template, and KOD FX DNA polymerase. 


. Amplify the DNA fragment encoding Asp1 to Pro40 with the 


linker at the N-terminus by PCR with G4S3VLrev and cpVL- 
Notfor as primers. 


. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 


System. 


. Perform overlap-extension PCR with the gel-purified PCR 


products (20 ng each) and KOD FX DNA polymerase for 
15 cycles without primers and 35 cycles with AgecpVLrev and 
cpVLNotfor primers. 


. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 


System. 
Ligate into pCR4Blunt-TOPO. 


To make a fusion protein comprising Clampbody and cpBLA, first 
make a construct for the cpBLA with known 3D structure that 
Guntas et al. reported [23, 24]. 


1. 


2. 


4. 


Amplify the DNA fragment encoding Trp227 to Trp286 of 
TEM-1 £-lactamase with a short peptide linker (GSGGS) at 
the C-terminus by PCR using the primers InfVHSpeRG13rev 
and RG13OLI for, pIT2/311J3 asa template, and PrimeSTAR® 
Max DNA Polymerase. 


Amplify the fragment encoding His24 to Gly226 with the 
linker at the N-terminus by PCR with RG13OL2rev and 
RG13AgeVLInffor as primers. 


. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 


System. 

With the gel-purified PCR products (20 ng each), perform 
overlap-extension PCR with PrimeSTAR® Max DNA 
Polymerase for 15 cycles without primers and 35 cycles with 
InfVHSpeRG13rev and RG13AgeVLInffor primers. 


. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 


System. 


Make a construct for a fusion protein comprising Clampbody and 
cpBLA by connecting cpVy and cpV, to the N- and C-termini of 
cpBLA [7] with a minimum number of amino acid residues 
(Clampbody-cpBLA, Cbody-cpBLA) (Fig. 1b, c). 


3.6 Construction 
of scCb Expression 
Vectors 


3.7 Expression 
and Purification 
of Clampbodies [6, 25] 
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1. Amplify the DNA fragment encoding Neol, Spel, Agel, and 
NotI sites by PCR using the primers MCScpBLArev and 
cpBLAMCSfor, pET26/Fv-cpBLA as a template, and KOD 
FX DNA polymerase. 


2. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 
System. 


3. Digest the gel-purified PCR product (2 ppg) with Neol HF and 
Notl HF. 


4. Purify the resulting DNA fragments using the Wizard® SV Gel 
and PCR Clean-Up System and insert into pIT2(13CG2) 
digested with the same, resulting in pIT2/MCS. 


5. Digest the cpVy gene with Neo I and Spe I. 

6. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 
System and inserted into pIT2/MCS digested with the same. 

7. After the cpVy insertion, digest the cpV; gene with Age I and 
Not I. 


8. Purify these fragments using Wizard® SV Gel and PCR 
Clean-Up System and inserted into cpVy-inserted pIT2/ 
MCS. 


9. Digest these plasmids with Neol and NozI. 
10. Purify these fragments by Wizard® SV Gel and PCR Clean-Up 


System and insert into pET30b(+) digested with the same, 
resulting in pET30b/cpVH-cpVL. 
11. Insert the cpBLA(RG13) gene into pET30b/cpVH-cpVL 


with In-Fusion® HD Cloning Kit according to the manufac- 
turer, resulting in pClamp-cpBLA. 


Prepare a single-chain protein construct where cpVy and cpVy are 
connected by a short linker G,S (single-chain Clampbody, scCb) 
to evaluate the antigen-binding activity of the Clampbody. 


1. Anneal TSG4TGrev and TSG4TGfor in Tris-EDTA buffer to 
prepare the DNA fragment encoding the SG, linker. 


2. Insert the fragment into pClamp-cpBLA digested with Agel 
and Spel, resulting in pClamp-SG4. 


When Clampbody and Cbody-cpBLA are expressed in E. coli, the 
majority of the expressed proteins will accumulate in inclusion 
body even if cultured at lower temperature. In such cases, perform 
the following procedures to solubilize the insoluble Clampbodies. 


1. Transform SHuffle T7 Express lysY competent cells with 
expression vectors (see Note 1). 
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. Incubate the transformed E. cold on a LB medium containing 


antibiotics (50 pg/mL kanamycin) (LBK) and 1.5% agar at 
37 °C for 24 h. 


. Pick one colony up and culture it in 4 mL LBK medium at 


37 °C. 


. Inoculate the 1 mL of cultured cells to 250 mL of LBK medium 


in a 1000 mL flask, and culture in 37 °C. 


. When the OD6o9 reaches 0.4-0.6, add to a final concentration 


of 0.4 mM of IPTG, and culture the cells at 16 °C overnight. 


. Centrifuge at 10,000 x g for 15 min at 4 °C to collect the cells 


(see Note 2). 


. Resuspend the pellet with 30 mL of TALON extraction buffer 


and sonicate the E. coli on ice using a supersonic homogenizer 
VP-15S for 2 min (50% interval) three times. 


. Centrifuge the sonicated solution at 10,000 x g for 60 min at 


4 °C (see Note 3). 


. Resuspend the pellet with 25 mL of distilled water containing 


1% Triton-X100 and 1 mM EDTA. 


. Centrifuge at 75,000 x g for 15 min at 4 °C. Resuspend pellet 


with 25 mL of distilled water containing 1% Triton-X100 and 
1 mM EDTA. 


. Repeat step 11 to wash the pellet. 
. Resuspend the pellet with 10 mL of TALON extraction buffer 


containing 6 M guanidium hydrochloride (Gdn-HCl) to 
solubilize it. 


. Centrifuge at 75,000 x g for 15 min at 4 °C and collect the 


supernatant carefully. 


. Add 200 pL of TALON Metal Affinity Resin to the supernatant, 


and rotate or shake gently for 20 min (see Note 4). 


. Centrifuge at 700 x g for 2 min, and resuspend the resin with 


10 mL of TALON extraction buffer containng 6 M 
Gdn-HCl. 


. Rotate or shake gently for 10 min. 

. Repeat steps 11 and 12 twice. 

. Centrifuge at 700 x g for 2 min, and collect the resin. 

. Suspend the resin with 3 mL of TALON extraction buffer con- 


taining 6 M Gdn-HCl, and put the suspension into a TALON 
2-mL Disposable Gravity Column. 


. Allow to drain until it reaches the top of the resin bed. 
. Wash the column with 2 mL of TALON extraction buffer 


containing 6 M Gdn-HCl. 


. Elute the protein with 1.5 mL of TALON elution buffer 


containing 5.4 M Gdn-HCl. 


3.8 Refolding 
of the Solubilized 
Proteins 


3.9 Western Blotting 
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Refold the purified denatured Clampbodies by stepwise dialysis 
procedure as below. 


1 


10. 


. Add 1.05 pL of 2-mercaptoethanol and 3 pL of 0.5 M EDTA 


to 1.5 mL of purified solubilized protein. 


. Rotate gently at room temperature for 30 min to reduce disul- 


fide bonds. 


. Inject protein solution into Slide-A-Lyzer Dialysis Cassettes, 


3.5 K MWCO, 3 mL. 


. Dialyze protein solution by putting the Slide-A-Lyzer Dialysis 


Cassettes into 1 L of Dialysis buffer containing 3 M Gdn-HCL. 
Incubate at 4 °C for overnight (see Note 5). 


. Dialyze protein against Dialysis buffer containing 2 M Gdn- 


HCl at 4 °C overnight. 


. Dialyze protein against Dialysis buffer containing 1 M Gdn-HCl 


at 4 °C overnight. 


. Dialyze protein against Dialysis buffer containing 0.5 M 


Gdn-HCl at 4 °C overnight. 


. Dialyze protein against Dialysis buffer without Gdn-HCl at 4 °C 


overnight. 


. Collect the contents in the cassette and centrifuge at 20,000 x g 


for 5 min at 4 °C. 


Collect the supernatant (see Note 6). Measure the concentration 
of purified protein with a Pierce™ BCA protein assay kit as per 
manufacturer’s instructions. 


After the preparation of Clampbodies, perform SDS-PAGE and 
Western blotting to confirm the purity and the amount of the 
proteins (Fig. Ic, d). 


ds 


Analyze the proteins by SDS-PAGE using a 5—20% gradient 
gel and SDS running buffer. 


. Transfer the protein from the gel to a PVDF membrane with 


Transfer buffer. 


. Incubate the membrane in TBST-S at 4 °C overnight. 
. Wash membrane three times with TBST. 


. Incubate the membrane with HRP-conjugated anti-His, anti- 


body 4000-times diluted in TBST at 4 °C for 1 h. 


. Wash membrane three times with TBST. 


. Soak membrane in Amersham ECL Prime as per manufactur- 


er’s instructions. 


. Detect the luminescence using a  LAS-4000 mini 


Luminoimager. 
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Fig. 2 Antigen-binding activity of Fv, sc-Cbody, and Cbody-cpBLA. (a) Binding of parental Fv (red triangle) and 
sc-Cbody (blue circle) to the biotinylated BGP-C11 antigen immobilized at the indicated concentrations. (b) 
Binding of Cbody-cpBLA to the biotinylated BGP-C11 immobilized at 0.9 sM. Blue and red bars indicate the 
binding signals to antigen-positive and negative wells, respectively. Averages of three samples with an error 
bar of 1 SD are shown. Reproduced from Iwai et al. [10] with permission from American Chemical Society 


3.10 ELISAs Perform following ELISAs to confirm that the refolded 
of Clampbodies Clampbodies retain the antigen-binding activity derived of the 
parental Vj; and V; domains (Fig. 2a, b). 


1. Apply 50 pL per well of 10 pg/mL streptavidin in microplate 
wells at 4 °C overnight. 


2. Discard the supernatant carefully, and then apply 200 pL/well 
of PBST supplemented with 20% Immunoblock at room tem- 
perature for 1 h. 


3. Discard the supernatant carefully, and wash wells with PBST 
three times. 


4. Apply 50 pL per well of 1 pg/mL biotinylated BGP-C11 pep- 
tide in PBST containing 5% Immunoblock at room tempera- 
ture for 1 h. 


5. Discard the supernatant carefully, and wash wells with PBST 
three times. 


6. Apply 50 pL per well of 100 nM protein samples in PBST con- 
taining 5% Immunoblock at room temperature for 1 h. Use 
approximately 1 pM protein stock and dilute it in PBST con- 
taining 5% Immunoblock to make 100 nM working solution 
immediately before the assay. 


7. Discard the supernatant carefully, and wash wells with PBST 
three times. 


8. Apply 50 pL per well of HRP-conjugated anti-His, antibody 
4000-times diluted in PBST containing 5% Immunoblock at 
room temperature for | h. 


3.11 Clampbody 
Activity Assay 
with Fluorocillin 
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9. Discard the supernatant carefully, and wash wells with PBST 
three times. 


10. Apply 50 pL per well of TMBZ reaction buffer at room tem- 
perature. After several minutes, add 50 pL per well of 10% 
H,SO, to stop the reaction. 


11. Detect the absorbance at 450 nm with a reference absorbance 
at 650 nm. 


Measure catalytic activity of Cbody-cpBLA using a fluorogenic 
substrate fluorocillin (Fig. 3). The antigen-dependency of Cbody- 
cpBLA becomes more significant in reaction buffers containing 
appropriate amount of denaturant or detergent such as urea and 
Triton X-100. 


a PBS b 1M Urea 


t(min) 
c 0.1% Triton X-100, d 
osop 200 U/ML BSA 
§7|-a-- 7 
. 
2000 B4 
§ 
= ee 3 5 
LL oO 
© 1000 £2 
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® 
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0 3 6600s 120 0 04 #14 10 100 1000 
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Fig. 3 Antigen-dependent catalytic activity of Cbody-cpBLA. The solution of 
70 nM Cbhody-cpBLA, 1 g/mL BGP-C10, and 1 pM fluorocillin was mixed in (a) 
PBS, (b) PBS containing 1 M urea, and (c) PBS containing 0.1% Triton X-100 and 
250 pg/mL BSA, incubated at 30 °C, and read for fluorescence. Background fluo- 
rescence of the samples without Cbody-cpBLA was subtracted for each condi- 
tion. Averages of three samples with an error bar of 1 SD are shown. Blue circle 
indicates the catalytic activity in the presence of 1 g/mL BGP-C10, and red tri- 
angle indicates in the absence of peptide. (d) Dose — response curves for BGP- 
C10 and BGP-C10dV peptides in the reaction buffer as in (c). The reaction rates 
at each peptide concentration are shown as the relative value to the rate in the 
absence of peptides. Statistical analysis was conducted using the two-tailed 
unpaired Student’s f test: («) p < 0.05, (**) p < 0.01. Reproduced from Iwai 
et al. [10] with permission from American Chemical Society 


162 Hiroto Iwai et al. 


3.12 Clampbody 
Activity Assay 
with Nitrocefin 


4 Notes 


. Prepare BGP-C10 (NH,-EAYRRFYGPV-COOH) and BGP- 


Cl0dV (NH,-EAYRRFYGP-COOH) as antigen and non- 
antigen peptide, respectively. 


. Mix 25 pL per well of 140 nM protein samples in PBS and 


25 pL per well of antigen peptides in PBS. Incubate for 
30 min at room temperature in a black microplate. Use 
approximately 1 1M protein stock and dilute it in PBS imme- 
diately before the assay. 


. Apply 50 pL per well of 2 1M fluorocillin in PBS containing 


urea, TritonX- 100, Tween-20 or BSA, at varying concentrations. 
Use 1 mM fluorocillin stock solution in DMSO and dilute it in 
PBS immediately before the assay. 


. Immediately read for their fluorescence intensity at 535 nm 


with 485 nm at 30 °C for 2 h using a fluorescence plate reader 
GENios Pro. 


. Fit the initial rates of reactions to the Michaelis-Menten equa- 


tion as follows: 
v= Vmax[S]/(Km+[S]) 
by the nonlinear least squares algorithm of Kaleidagraph 4.1 


(Synergy Software, Reading, PA, USA). 


Evaluate the catalytic activity of Cbody-cpBLA using a colorimet- 
ric substrate nitrocefin, as shown in Fig. 4. 


1. Mix 25 pL per well of 140 nM proteins in PBS and 25 pL per 


well of antigen samples in PBS. Incubate for 30 min at room 
temperature in transparent microplate. Use approximately 
1 uM stock protein and dilute this in PBS immediately before 
the assay. 


2. Apply 50 pL per well of 0-500 1M nitrocefin in PBS contain- 


ing 2.4 Murea. Use 20 mM nitrocefin stock solution in DMSO 
and dilute it in PBS immediately before the assay. 


. Immediately read for their absorbance at 486 nm at 30 °C for 


1 h using a microplate reader SH-1000. 


. Calculate the catalytic parameters following the same proce- 


dures in Subheading 3.11. 


. SHuffle T7 Express lysY is an engineered E. coli B strain to 


express proteins containing disulfide bonds in the cytoplasm. 
Constitutively expressed disulfide bond isomerase DsbC 
promotes the correction of mis-oxidized proteins into their 
correct form, and also works as a chaperone that can assist in 
the folding of proteins that do not require disulfide bonds. 


v (mirr) 


0 50 100 
Nitrocefin (uM) Nitrocefin (uM) 


150 
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Fig. 4 S-V plots for BLA and Cbhody-cpBLA. (a) 2 nM BLA or (b) 70 nM Cbody-cpBLA was added with nitrocefin 
substrate in the presence or absence of antigen 1 uM BGP-C10, in PBS containing 1.2 M urea, incubated at 
30 °C, and read for absorbance. Background absorbance of the samples without BLA or Cbody-cpBLA was 
subtracted for each condition. Initial velocities are plotted against the substrate concentration. Averages of 
three samples with an error bar for 1 SD are shown. Fitted curves based on the Michaelis-Menten equation are 
shown for each condition. Statistical analysis was conducted using the two-tailed unpaired Student’s é-test. 
(x) p < 0.01. Reproduced from Iwai et al. [10] with permission from American Chemical Society 


. Culture supernatant (2 mL) can be precipitated with 10% 


trichloroacetic acid, and washed twice with cold acetone, and 
resuspended in 20 pL of PBS and used for SDS-PAGE. 


. The supernatant including soluble protein should be kept for 


SDS-PAGE. 


. Before adding to the solubilized protein, TALON metal affinity 


resin should be washed twice with TALON extraction buffer 
containing 6 M Gdn-HCl. 


. Keep stirring the buffer using a magnetic stirrer. 


. The purified enzyme in the buffer containing 15% glycerol can 


be stored for several months at —80 °C. The concentration of 
the proteins in stock solution was typically 1 1M in our hands. 
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Chapter 11 


Protein and Protease Sensing by Allosteric Derepression 


Hui Chin Goh, Farid J. Ghadessy, and Saurabh Nirantar 


Abstract 


Peptide motifs are crucial mediators of protein-protein interactions as well as sites of specific protease activity. 
The detection and characterization of these events is therefore indispensable for a detailed understanding 
of cellular regulation. Here, we present versatile and modular sensors that allow the user to detect protease 
activity and protein-peptide interactions, as well as to screen for inhibitors using chromogenic, fluorescent, 
or luminescent output. 


Key words Protease sensors, Protein-protein interaction, Drug screening 


1. Introduction 


Sensing the binding of macromolecules to and cleavage of linear 
peptide motifs is of great importance in elucidating the many regu- 
latory processes in the cell. Several interaction-mediating domains 
like the PDZ or SH2 families bind their cognate peptide to enable 
colocalization of interacting proteins [1-3]. Similarly, processes as 
diverse as viral maturation and blood clotting depend on the rec- 
ognition of a specific peptide sequence by their target proteases 
such as thrombin or tobacco etch virus (TEV) protease [4, 5]. 

In vitro studies with such peptides are very useful in detecting 
and characterizing these functions. Typically, in vitro characteriza- 
tion of such events relies on techniques such as fluorescence polar- 
ization (FP) [6], enzyme-linked immunosorbent assays (ELISA) 
[7], surface plasmon resonance (SPR) [8], and Forster resonance 
energy transfer (FRET) [9]. While powerful, these methods require 
substantial instrumentation and preparation and suffer from a num- 
ber of limitations. These include interference from autofluorescent 
molecules in case of FP measurements, multiple washing and 
incubation steps in case of ELISAs, or low throughput for SPR. 

Protein engineering-based methods have been developed to 
address this need. These typically involve the insertion of a relevant 
peptide or protein domain into a read-out capable enzyme such 
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that binding of the peptide or cleavage thereof leads to a change in 
parameters like enzyme activity, thus detecting the event [10-12]. 
However, these systems are typically custom-made and require 
substantial empirical optimization. There is therefore a need for 
simpler, modularly organized systems that allow the user to study 
these processes. 

We have developed such a system based on auto-inhibited 
enzymes. The basic structure of a sensor is depicted in Fig. 1. 
Briefly, it comprises a read-out enzyme (e.g., Tem] B-lactamase) 
fused via a flexible peptide linker that incorporates the peptide of 
interest, to a domain that binds and inhibits the read-out enzyme. 
The flexible peptide retains the inhibitor in close proximity to the 
read-out enzyme resulting in a high local concentration, and allows 
the inhibitor domain to reach out and bind the read-out module, 
thereby inhibiting it. 

For the detection of proteases, cleavage of the peptide of interest 
in the flexible linker allows the dissociation of the inhibitor domain 
and its diffusion into the bulk solution leading to derepression and 
signal generation by the read-out enzyme [13]. 

In the case of peptide-protein binding events, the peptide of 
interest is included in the flexible linker such that binding ofa large 


of 
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Fig. 1 The sensor is a recombinant fusion protein comprising of a read-out capa- 
ble enzyme (gray), and an inhibitor of the enzyme (b/ack) connected by a linker. 
The peptide of interest is placed in the linker region. The close proximity of the 
enzyme and inhibitor enabled by being fused together via the linker leads to a 
high effective local concentration and strong inhibition. Left: Binding of the pep- 
tide of interest to a macromolecule (white hexagon) will interfere sterically with 
enzyme-inhibitor binding if the linker design is optimized. This leads to stabiliza- 
tion of the open state and turnover of substrate (white stars) leading to signal 
generation (gray stars). Right Upon cleavage of the peptide by a suitable prote- 
ase, the inhibitor is free to diffuse throughout the bulk solution, leading to de- 
repression of the enzyme and signal generation 


2 Materials 


2.1 Cloning 
of Expression 
Constructs 
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macromolecule like a protein domain to the peptide in the linker 
sterically hinders binding of the inhibitor to the read-out enzyme 
(Fig. 1). Thus, the presence of a peptide-interacting macromole- 
cule stabilizes the open state of the read-out enzyme and increases 
its activity. It is important that the flexible linker is sufficiently long 
to allow the inhibitor to bind to the read-out enzyme, but not long 
enough to allow a macromolecule to bind to the peptide without 
sterically hindering the interaction between the read-out enzyme 
and the inhibitor. In this case, the exact linker length requires 
empirical optimization. Sensitivity can also be enhanced in the case 
of bivalently binding macromolecules such as antibodies [13]. For 
instance, this can be achieved by placing a second protein-interacting 
peptide sequence in the sensor that can bind one arm of the anti- 
body molecule and thus tether the other arm in close proximity of 
the peptide in the linker. In this way, its local concentration is 
increased and thus leads to increased sensitivity. 

We have also developed another, more modular peptide-protein 
interaction detection methodology [14]. This involves a similar 
architecture, but instead of relying on the macromolecule sterically 
clashing with the read-out enzyme and inhibitor binding, the mac- 
romolecule binding to its cognate peptide now sterically interferes 
with proteolytic cleavage at a proteolytic cleavage site in close prox- 
imity to the peptide of interest. Thus, macromolecule binding pre- 
vents sensor activation by the protease. This system is particularly 
useful for drug screening as peptide-protein antagonist drugs will 
restore protease access and lead to signal generation [14]. 

Below is a detailed description of the construction and use of 
these sensors. 


1. Expression constructs can either be synthesized commercially 
(Genscript) or custom cloned. 


2. A plasmid coding for Tem1-BLIP D49A (see Note 1) inserted 
into the NdeI and Xhol sites of a pET28a vector with a 
C-terminal His6-tag (see Note 2). Here, BLIP stands for 
B-lactamase inhibitor protein while D49A refers to a mutation 
in BLIP that reduces affinity for Tem] [15]. The coding DNA 
sequence for a fusion of thermostable Tem] f-lactamase 
enzyme [16] and BLIP which is used as a template for insert- 
ing the desired peptide sequence between Teml and BLIP, 
and if necessary at the N- or C-termini as well, is given (see 
Note 3). Also, it is advisable to avoid Amp® cassette containing 
vectors (see Note 4). 


3. Standard reagents for high-fidelity PCR amplification: 10x 
reaction buffer, 10 mM each dNTP solution, and a thermo- 
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2.2 Expression 
of Recombinant 
Proteins 


10. 


11. 


12. 


stable DNA polymerase such as Pfu Turbo polymerase 
(e.g., Agilent). 


. Standard desalted oligonucleotide primers for PCR. The primer 


stock solution is 10 1M. For illustration, primers used for insert- 
ing a proteolytic cleavage site for SplB (available from 
ThermoFisher; also known as WELQut protease) between 
Teml and BLIP = are shown: SplB-ins forward: 
5’-TGGGAACT GCAGGGTTCTAGTGGTGGCGAA 
AAAATT-3’ and SplB-ins reverse: 5’-ACCCTGCAGTTC 
CCAACCACCGGCACCACTACCGCCCCAATG -3’. 


. PCR purification kit (e.g., QIAquick PCR purification kit, 
QIAGEN). 
. In-Fusion cloning kit (Clontech). The user may substitute 


another cloning method of choice instead. 


. Competent £. coli cells for transforming DNA: One Shot 


TOP10 Chemically Competent Cells E. coli (e.g., Invitrogen). 


. LB-Kan agar plates: 5 g/L yeast extract, 10 g/L peptone, 


10 g/L NaCl, and 15 g/L agar supplemented with 50 pg/mL 
kanamycin. 


. LB-Kan liquid medium: 5 g/L yeast extract, 10 g/L peptone, 


and 10 g/L NaCl supplemented with 50 pg/mL kanamycin. 
Commercial plasmid miniprep kit (e.g., Qiagen) or self-made 
reagents. 

Oligonucleotide primers suitable for sequencing newly con- 
structed plasmids. If using the pET28a vector, petF2 
(5’-CATCGGTGATGTCGGCGAT-3’) and pETR (5'-CGGA 
TATAGTTCCTCCTTTCAGCA-3’) are suitable for sequenc- 
ing from the N- and C-terminal ends, respectively. 


DNA Sequencing facility and suitable sequence analysis soft- 
ware (e.g., DNAStar Lasergene Suite or Snapgene). 


1. SHuffle T7 Competent E. coli (NEB). (See also Note 5. 


. LB-Kan agar plates: 10 g/L yeast extract, 20 g/L peptone, 


20 g/L sodium chloride, and 15 g/L agar supplemented with 
50 pg/mL kanamycin. Other non-ampicillin plates are also 
suitable for use with alternative vector systems. 


. LB-Kan liquid medium: 10 g/L yeast extract, 20 g/L pep- 


tone, and 20 g/L sodium chloride supplemented with 50 pg/ 
mL kanamycin. 


4. Stock solution for inducing protein expression: 1 M IPTG. 


. Erlenmeyer flasks and incubator-shaker for bacterial cell 
culture. 
. Centrifuge for pelleting bacterial cells (e.g. Beckman 


ultracentrifuge). 


2.3 Purification 
of Expressed Protein 


2.4 Biosensor Assay 


3 Methods 


3.1 Plasmid 
Construction 


iS) 


nm ® wW 
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. Lysis and Binding buffer: 50 mM sodium phosphate, 300 mM 


sodium chloride, 30 mM imidazole, pH 7.4. 


. Elution buffer: 50 mM sodium phosphate, 300 mM sodium 


chloride, 500 mM imidazole, pH 7.4. 


. Sonicator or other means of cell lysis. 

. Syringes and 0.45 pm filter to clarify cell lysate. 

. 1 mL Nickel Sepharose His Trap column (GE Healthcare). 

. SDS-PAGE electrophoresis equipment to verify protein 


expression and quality. 


. 96- or 384-well transparent bottom plate (Greiner Bio) for 


absorbance reading. For alternative readouts based on fluores- 
cence or luminescence, other types of plates are required. 


. Phosphate buffered saline (PBS) as diluent: 10 mM Na,HPOu, 


2 mM KH,PO,, 137 mM NaCl, and 2.7 mM KCl, pH 7.4. 


. 1 mM nitrocefin substrate stock solution: 1 mg _ nitrocefin 


(Merck) dissolved in 100 pL DMSO and 1.9 mL PBS as per 
manufacturer’s instructions. 


. Multi-well plate reader capable of reading OD at 492 nm. 


Target protease or target protein binder whose cleavage or 
binding site is present in the biosensor linker. By way of exam- 
ple, enterokinase (NEB), TEV protease, and anti-HA tag anti- 
body have been successfully detected [13]. 


. After the gene coding for Tem1-BLIP (see Note 3) has been 


synthesized, it may be used as a standard template for inserting 
suitable peptides as required. We generally use custom gene 
synthesis to obtain the backbone, but the plasmid may also be 
obtained from our lab upon request. For cloning, we typically 
use inverse PCR with a reverse primer binding at the 3’ end 
of the Tem] ORF and a forward primer binding at the 5’ end 
of the BLIP ORF while the 5’ tails encode the desired peptide 
and linker amino acids. Furthermore, the 5’ fifteen bases of the 
primers are complementary to each other to enable Infusion 
cloning. Alternatively, if the sequence of the insert is too long, 
the desired plasmid can be constructed by means of serial PCR 
using overlapping primers. We frequently employ primers that 
have been gel purified, especially if the oligonucleotide is 
greater than 40 bases to avoid single-base dropouts and 
frameshifts. 


. A typical PCR reaction based on Pfu Turbo polymerase typically 


contains in a total volume of 50 pL (Pfu Turbo polymerase is 
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3.2 Protein 1. 


Expression 
and Purification 


typically only added to the PCR when the denaturation 
temperature has been reached to reduce mispriming): 


5 pL 10x reaction buffer. 

1 pL 10mM each dNTP solution. 

1 pL primer SpIB-ins forward (10uM stock). 
1 pL primer SpIB-ins reverse (10uM stock). 
3 pL DMSO. 

1-10 ng DNA template. 

1 pL Pfu Turbo polymerase. 

H,0 added to a final volume of 50 pL. 


. The backbone is typically amplified using the following PCR 


cycling protocol: 95 °C for 3 min, [95 °C for 10 s, 52 °C for 
10 s, 68 °C for 7 min] x 30 cycles, 72 °C for 5 min. 


. Following PCR, purify the reaction using a commercial PCR 


purification kit as per manufacturer’s instruction, and verify the 
resulting DNA product by analytical agarose gel electrophoresis. 
The DNA product should be ~7 kb when using Tem1-BLIP 
inserted into the pET28a vector backbone. 


. Add a suitable amount of purified PCR product into an infusion 


cloning reaction according to manufacturer’s instructions. The 
complementary 5’ fifteen bases on both forward and reverse 
primers will enable an inter-molecular infusion reaction that will 
lead to the formation of the desired plasmid. Dilute 5x with 
water and transform into One Shot TOP10 Chemically 
Competent Cells as per manufacturer’s instructions before plat- 
ing them on LB-Kan agar plates and incubating overnight. 


. Inoculate a few of the resulting colonies in LB-Kan liquid 


medium overnight and prepare plasmid DNA using a commer- 
cial miniprep kit according to manufacturer’s instructions. 


. Sequence the resulting plasmids using primers pETF2 and 


pETR and ensure the sequence is correct. 


Transform the verified plasmid into SHuffle T7 Competent E. 
cola as per manufacturer’s instructions. Inoculate the resulting 
colonies overnight at 30 °C. The next day, dilute this culture at 
1% into 800 mL LB-Kan liquid medium at 30 °C. Induce with 
IPTG (0.4mM final) when OD reaches 0.5—-0.8 followed by 
overnight incubation at 16 °C. Pellet the cells by centrifuga- 
tion. The cells can be frozen at —80 °C for later processing. 


. Resuspend the cells in 10 mL Lysis and Binding buffer in a 


50 mL Falcon tube and place on ice. Lyse cells by sonication 
for 10 min using the following pulsing sequence: 5 s sonication 
at amplitude 35 followed by 10 s rest. Centrifuge at 4 °C at 
10,000 x g for 10 min and collect the supernatant. Aspirate the 


3.3 Biosensor 
Assays 


3.3.1 Protease 
and Macromolecule- 
Binding Assay 
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supernatant into a syringe and attach a 0.45 pm filter to the 
end. Filter the supernatant to remove large particles. Replace 
the filter if it becomes clogged. 


. Equilibrate a 1 mL Nickel Sepharose His Trap column with 


Lysis and Binding buffer by washing the column with 10 col- 
umn volumes. Load the clarified protein lysate through the 
column. Keep an aliquot of the lysate and flow-through for 
analysis. Wash the column with 10 column volumes of Lysis 
and Binding buffer and collect an aliquot for analysis. Elute 
with 10 mL of Elution Buffer. Load equal volume fractions of 
all collected aliquots on a SDS-PAGE gel to estimate the effi- 
ciency of purification. Add glycerol to a final concentration of 
50% and store at —20 °C. 


. Alternatively, if using an FPLC instrument, equilibrate column 


with 10 column volumes of Lysis and Binding buffer. Clarified 
lysate is loaded into the system at a flow rate of 1 mL/min. 
Wash column with 10 column volumes of Lysis and Binding 
buffer. Samples are eluted using gradient length of 25. The 
entire run can be done at a flow rate of 1 mL/min with the 
pressure kept below 0.30 MPa. 


. The assays are generally carried out at room temperature in a 


384-well transparent bottom plate. 


. The exact concentration of the sensor and protease or macro- 


molecular binder must be determined empirically. Before start- 
ing the assay, test a range of sensor concentrations with varying 
concentrations of protease or macromolecular binder to deter- 
mine the point of optimal sensitivity. 


. When setting up multiple reactions, it is desirable to make a 


master mix containing the common components. The typical 
reaction in a total volume of 25 pL is set up as follows. 


. Add a suitable volume of PBS that will eventually make up a 


total reaction volume of 25 wL (1.e., we add PBS first to pre- 
vent small volumes of other reagents from sticking to the side 
of the well or remaining stuck to the pipette tip). 


. Add 2 wL of diluted sensor to a final concentration of approxi- 


mately 5 nM and mix with 2-5 pL of various dilutions of protease 
or macromolecular binder. The final optimal concentration of 
components needs to be determined empirically. For enterokinase 
or TEV protease, the threshold of detection is approximately in 
the mid-picomolar to low-nanomolar range, respectively. 


. Initiate the reaction by adding 5-7 pL 1 mM nitrocefin 


substrate stock solution to a final concentration of 250 1M 
(see Notes 6 and 7). 


. After the addition of nitrocefin, immediately insert the plate 


into a plate reader and measure the OD at 492 nm every 2 min. 
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3.3.2 Protease 1 


Exclusion Assay 


We typically measure enzyme activity by subtracting the 
OD492 value of read 3 from read 5 to determine substrate 
turnover/time (see Note 8). 


. The protease exclusion assay is carried out as above with the 


important exception that the macromolecular binder is added 
to the sensor and allowed to bind before the protease is added. 
This prevents premature cleavage of the sensor by protease 
before the macromolecule has an opportunity to bind and pro- 
tect the sensor. 


2. The typical reaction is set up as follows (see Note 9): 


4 Notes 


. Add a suitable volume of PBS that will eventually make up a total 


reaction volume of 25 pL (see step 4 in Subheading 3.3.1). 


. Add 2 pL of diluted sensor to a final concentration of approxi- 


mately 5 nM and mix with 2-5 pL of a macromolecular binder. 
This should be in excess of the sensor concentration to fully 
block all sensor molecules. The exact ratio will depend on the 
K, and should be confirmed empirically. Ideally, these compo- 
nents should be part of a master mix. 


. Add 2 pL of the macromolecular antagonist. The concentra- 


tion of the antagonist should be at least as much as the macro- 
molecule to allow complete dissociation. 


. After adding the macromolecular binder, incubate for 5-10 min 


to allow an equilibrium to be reached (see Notes 10 and 11). 


. Add 2 yL of protease. The optimal concentration needs to be 


determined empirically, e.g., for enterokinase we found 
~0.15 nM to be suitable. 


. Initiate the enzyme reaction by adding 5-7 pL 1 mM nitroce- 


fin substrate stock solution to a final concentration of 250 uM 
(see Notes 6 and 7). 


. Analyze the reaction by plotting the rate of substrate turnover 


against the concentration of the antagonist or blocking 
macromolecule. 


. In some cases, the affinity of BLIP D49A for Teml may be 


modulated for improved performance. So reverting BLIP to WT 
or introducing additional mutations like W150A may be helpful. 
Wang et al. [19] and Reichmann et al. [20] provide a spectrum 
of mutations that could be tested for this purpose. 


. This concept can be extended to enzyme-inhibitor pairs other 


than Tem1-BLIP. When constructing alternative enzyme- 
linker-inhibitor ORFs, we would recommend placing purifica- 
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tion tags like His6-tag on the same side as the inhibitor. This 
ensures that if truncated fragments are present in the cell lysate, 
it is the inhibitor and not free enzyme which is pulled down. 
Free enzyme can lead to a high background. 

. The DNA sequence coding for Tem1-BLIP is given below. For 
expression, it may be inserted into pET28a NdeI XhoI sites: 
5’-aaaaaaCATATGCACCCGGAAACCCTGGTTAAAGTCA 
AAGATGCCGAAGACCAGCTGGGTGCACGCGTTGGC 
TATATTGAACTGGATCTGAACTCAGGCAAAATCCTG 
GAATCGTTTCGTTCTGAAGAACGCTTCCCGATG 
ATGTCAACCTTTAAAGTTCTGCTGTGCGGTG 
CTATCCTGTCGCGTATTGATGCGGGTCAGGAA 
CAACTGGGCCGTCGCATCCATTATTCACAGAAT 
GATCTGGTCGAATACTCGCCGGTGACCGAAAAA 
CACCTGACCGACGGCATGACGGTGCGCGAACTGT 
GTAGCGCCGCAATTACCATGTCTGATAACACGGCT 
GCGAATCTGCTGCTGACCACGATCGGTGGT 
CCGAAAGGCCTGACCGCCTTCCTGCATAACATGGG 
TGATCACGTTACGCGTCTGGACCGCTGGGAA 
CCGGAACTGAACGAAGCCATTCCGAATGA 
TGAACGTGACACCACGACCCCGGTGGCAATGG 
CAACCACGCTGCGCAAACTGCTGACCGGCGAAC 
TGCTGACGCCGGCGAGTCGTCAGCAACTGA 
TGGATTGGATGGAAGCCGACAAAGTGGCAGGT 
CCGCTGCTGCGTAGCGTGCTGCCGGCAGGTT 
GGTTTATCGCAGATAAAAGTGGTGCAGGCGAA 
CGTGGTTCCCGCGGCATTGTGGCTGCAC 
TGGGTCCGGATGGCAAACCGAGTCGCATT 
GTGGTTATCTATACCACGGGCTCCCAGGCGA 
CCATGGATGAACTGAATCGCCAAATTG 
CTGAAATCGGTGCGAGCCTGATCAAACAT 
TGGGGCGGTAGTGGTGGCGAAAAAATTCG 
TCTGCGTGGATCCGCTGGTGTTATGACCGG 
CGCGAAATTTACGCAGATTCAATTCGGTAT 
GACCCGTCAGCAAGTCCTGGATATCGCCGGCG 
CAGAAAACTGCGAAACCGGCGGTAGCTTTGGT 
GACTCTATTCATTGTCGTGGTCACGCAGCAGGT 
GCATATTACGCGTATGCAACCTTTGGCTTCACG 
AGCGCAGCTGCGGATGCCAAAGTGGACTCAAAAT 
CGCAGGAAAAACTGCTGGCTCCGTCTGCACCGACC 
CTGACGCTGGCAAAATTCAATCAGGTCACCGTGGGT 
ATGACGCGTGCTCAGGTGCTGGCGACCGTTGG 
TCAAGGCAGTTGCACCACGTGGTCCGAATATTAC 
CCGGCTTACCCGAGCACCGCAGGTGTGACGCTGA 
GTCTGTCTTGTTTTGATGTTGACGGTTATAGCTCT 
ACCGGTTTCTACCGTGGCTCCGCGCATCTGTGGTTT 
ACGGATGGTGTCCTGCAGGGCAAACGCCAATGGGA 
CCTGGTGGGCGGTGGGAGCTCCTATCCGTACGAT 
GTTCCGGACTACGCCTAACTCGAGaaaaaa- 3’. 
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4, 


10. 


ll. 


pET28a or other non-ampicillin-based resistance marker 
plasmids are recommended because Tem] is produced by the 
ampicillin resistance gene. This can interfere with or contami- 
nate the recombinant sensor. Also ampicillin is a competitive 
inhibitor of the Tem1 enzyme, so it should be ensured that the 
reaction is free of it. 


. These cells are recommended as Tem] and BLIP are disulfide 


bond containing proteins and need an oxidizing environment 
for their formation. SHuffle T7 Competent E. coli are modi- 
fied to have an oxidizing environment in their cytoplasm, so 
disulfide bond containing protein can be expressed therein 
[17]. Alternatively, expression in the periplasm or re-folding 
from denatured proteins can be considered. 


. Nitrocefin is light sensitive and light exposure should be 


minimized. 


. Nitrocefin is known to be turned over in highly reducing or 


oxidizing environments [18]. We have found that the presence 
of strong reducing agents like DTT or -mercaptoethanol in 
the storage buffer of some macromolecules can lead to rapid 
enzyme-independent turnover of nitrocefin. This should be 
tested during the initial optimization. If this is found to be the 
case, the relevant buffer should be exchanged before adding to 
the reaction. 


. The plate reader we use (Perkin Emler Envision) takes 1-2 


reads to stabilize the signal. We have been using read 3 as our 
initial measurement point. Also, we typically allow the reaction 
to proceed for about 30-60 min to ensure that it is complete. 


. It is highly recommended that a master mix comprising PBS, 


the sensor, and the macromolecular binder is set up as this 
saves time and allows time for the sensor and macromolecule 
to bind each other. We typically set up the reaction on ice to 
minimize potential thermal unfolding. 


It is recommended that in each assay, suitable controls are 
included. These can include the protease and the sensor, but 
without the sterically interfering macromolecular binder to 
confirm that both sensor and protease are functional. 


Some of the potential inhibitors we tested were colored and 
had a substantial OD492 value. This can generally be dealt 
with by normalizing the sensor reaction against a control reac- 
tion with the potential inhibitor and other components except 
the sensor itself, or by subtracting the OD at read 1 from all 
subsequent reads. 
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Chapter 12 


DNA-Specific Biosensors Based on Intramolecular 
§-Lactamase-Inhibitor Complex Formation 


Wouter Engelen and Maarten Merkx 


Abstract 


Synthetic protein switches that sequence-specifically respond to oligonucleotide-based input triggers provide 
valuable tools for the readout of oligonucleotide-based biomolecular systems and networks. Here, we 
discuss a highly modular approach to reversibly control the DNA-directed assembly and disassembly of a 
complex between TEM1-f-lactamase and its inhibitor protein BLIP. By conjugating each protein to a 
unique handle oligonucleotide, the enzyme-inhibitor pair is noncovalently assembled upon the addition of 
a complementary ssDNA template strand, resulting in inhibition of enzyme activity. Hybridization of an 
input-oligonucleotide that is complementary to a target recognition sequence in the ssDNA template 
strand results in the formation of a rigid dsDNA helix that mechanically disrupts the enzyme-inhibitor 
complex, hereby restoring enzyme activity. Following this noncovalent approach allowed straightforward 
tuning of the ssDNA template recognition sequence and target oligonucleotide lengths with only a single 
set of oligonucleotide-functionalized enzyme and inhibitor domains. Using a fluorescent substrate, as little 
as 10 pM target oligonucleotide resulted in a distinguishable increase in enzyme activity. 


Key words f-lactamase, Reporter enzyme, DNA detection, Synthetic biology, Protein-DNA conjugation, 
Biosensor 


1 Introduction 


Synthetic protein switches are extensively used in synthetic biology, 
molecular imaging, and molecular diagnostics to study biological 
processes both 27 vitro and in vivo by responding to the presence 
of a specific input molecule. Ideally, molecular recognition (input) 
and signal generating functions (output) are part of separate 
domains in these switches, as such a modular organization allows 
straightforward exchange of input or output function [1]. A suc- 
cessful design approach that ensures efficient translation of input 
activation to output modulation, is to design switches that are able 
to adopt two conformational states; one in which two output 
domains can interact intramolecularly and one where their interaction 
is prohibited. The dynamic response of such a protein switch is 
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determined by the relative affinities of the output domain interaction 
and the magnitude of the input-induced thermodynamic change, 
e.g., the strength of ligand binding [2]. Examples of output 
domains that have been applied for these approaches include fluo- 
rescent or luminescent proteins domains forming FRET or BRET 
pairs. To ensure efficient energy transfer between them in either 
the on or off state, fluorescent domains can be used that have an 
increased tendency to form intramolecular complexes, or their 
close proximity can be promoted by so-called helper domains [3 ]. 
Our group used these self-associating fluorescent protein domains 
to develop robust FRET sensor protein for a range of analytes, 
including metal ions, bile acids, and antibodies [4-7 ]. In addition, 
by replacing the self-associating fluorescent domains by an enzyme- 
inhibitor pair with similar affinity, the optical readout of the anti- 
body FRET sensor could be replaced by a more sensitive enzymatic 
readout [8]. 

The modular organization of protein switches based on mutu- 
ally exclusive domain interactions does not only allow exchange of 
different output functions, but is also well suited to combine with 
oligonucleotide-based biomolecular systems and networks. Most 
approaches reported thus far to control enzyme activity by DNA are 
based on the templated assembly of split enzyme fragments [9, 
10]. Whereas split reporter enzymes have the advantage of provid- 
ing a low background activity (at least when used intermolecularly), 
split enzymes generally suffer from poor thermodynamic stability 
and protein complementation is often not reversible. Therefore, our 
group recently introduced an alternative approach that is based on 
the reversible assembly of a complex between the reporter enzyme 
TEM1-f-lactamase and its inhibitor domain BLIP [11-13]. This 
design exploits the inherent difference in mechanical properties 
between ssDNA and dsDNA to mechanically disrupt and spatially 
separate the enzyme-inhibitor pair upon binding of an input oligo- 
nucleotide to the input-binding module. To facilitate straightfor- 
ward exchange of input sequences, a noncovalent approach is 
followed by conjugating both TEM1--lactamase and BLIP to indi- 
vidual, 21 nucleotide handle oligonucleotides. By appending anti- 
handle sequences to each end of the synthetic ssDNA target 
recognition sequence, a ternary complex is formed upon mixing of 
the three protein switch components (Fig. 1). Formation of this ter- 
nary complex results in a large increase in effective concentration 
and hereby induces an intramolecular interaction between the 
enzyme-inhibitor pair. Hybridization of the complementary target 
oligonucleotide to the input module results in the formation of a 
rigid double-helix which mechanically disrupts the enzyme-inhibitor 
interaction, hereby switching on enzyme activity. 

Since this system is based on two well-folded and stable pro- 
tein domains, it is more robust compared to the assembly of split- 
protein fragments and readily reversible. Moreover, because of the 
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anti- target recognition 
handle sequence 
44444 _ Input module: ss Output module: 
~ ssDNA template ~  enzyme-inhibitor pair 
handle handle 


Fig. 1 Design of a modular DNA-directed biosensor based on intramolecular TEM1 -B-lactamase—inhibitor com- 
plex formation. To implement full modularity, a multi-component strategy is used where the input and output 
modules are noncovalently assembled. Therefore, the enzyme and inhibitor proteins are conjugated to oligonucle- 
otide handles that are complementary to anti-handle sequences appended to the target recognition sequence, 
allowing the assembly of the enzyme-inhibitor pair in a ternary complex (“OFF” state). Hybridization of a comple- 
mentary input oligonucleotide to the target recognition sequence results in the formation of a rigid double-helix, 
hereby mechanically disrupting the enzyme-inhibitor pair, i.e., the enzyme is switched on (“ON” state) 


2 Materials 


2.1 Protein 
Expression 
and Purification 


catalytic conversion of substrate to product, the enzyme-based 
DNA detection is more sensitive than direct optical readout of 
conformational switching sensors such as molecular beacons [14]. 
Here, we provide detailed protocols for the expression, purifica- 
tion, and bioconjugation of the TEM1-f-lactamase- and BLIP- 
oligonucleotide conjugates, as well as their assembly into an easily 
tunable system for oligonucleotide detection in molecular diagnos- 
tics and the readout of DNA-based molecular circuits. While the 
DNA-directed protein switch described in this chapter is originally 
designed to respond to ssDNA, the range of input molecules could 
be extended to small molecules and proteins by intelligent redesign 
of the input module using conformation-switching aptamers. Since 
plasmids for expression of both protein components will be made 
accessible via AddGene, we hope that these protocols will enable 
the TEM1-f-lactamase/BLIP system to be widely used. In addi- 
tion, both the design principles and the bioconjugation methods 
described here for the TEM1-f-lactamase/BLIP system should be 
generally applicable to other enzyme-inhibitor pairs, provided that 
they interact with a similar affinity. 


1. Plasmid pET29a_TEM1-B-lactamaseE104D_CtermC (encod- 
ing periplasmic leader sequence, His-tag, Thrombin cleav- 
age site, TEM1-f-lactamase(E104D), cysteine, Strep-tag) 
(see Notes 1 and 2). 
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2.2 Maleimide 
Functionalization 
of Amine-Modified 
Oligonucleotides 


2.3 Conjugation 
of Maleimide- 
Functionalized 
Oligonucleotides 
to Proteins 


2. 


17. 
18. 


2. 


Plasmid pET29a_BLIP_NtermC (encoding periplasmic leader 
sequence, His-tag, Thrombin cleavage site, cysteine, GGSHG 
linker, B-lactamase inhibitor protein (BLIP)) (see Notes 1 and 2). 


. Competent Escherichia coli BL21 (DE3) cells. 
. LB medium: 10 g peptone, 5 g yeast extract and 10 g NaCl in 


1 LdH,0O supplemented with 30 mg/L kanamycin. 


. 100 mM isopropyl B-p-1-thiogalactopyranoside (IPTG) stock 


solution. Filtered to sterilize (0.2 pm). 


. Shaking incubator. 
. High-speed centrifuge. 
. Osmotic shock solution: 30 mM Tris-HCl, 1 mM EDTA, 20% 


sucrose at pH 8.0. 


. Low salt solution: 5 mM MgSQ,. 

. 10x Tris-HCl buffer: 200 mM Tris-HCl at pH 7.4. 

. His-tag binding resin. 

. Charge buffer: 50 mM NiSO,. 

. Bind buffer: 20 mM Tris-HCl, 500 mM NaCl, 30 mM 


imidazole at pH 7.9. 


. Wash buffer: 20 mM Tris-HCl, 500 mM NaCl, 60 mM imidazole 


at pH 7.9. 


. Elution buffer: 20 mM Tris-HCl, 500 mM NaCl, 400 mM 


imidazole, 2 mM TCEP at pH 7.9. 


. Storage buffer: 20 mM Tris-HCl, 150 mM NaCl, 2 mM 


TCEP at pH 7.9. 
Amicon centrifugal filter unit (NMWL = 10 kDa). 
Nanodrop UV-VIS spectrophotometer. 


. Oligonucleotides modified with a primary amine at their 3’- or 


5’-terminus (e.g., Integrated DNA Technologies). 
Dimethylsulfoxide (DMSO). 


3. Ethanol (cooled to —30 °C). 
4. Phosphate buffered saline (PBS): 100 mM sodium phosphate, 


150 mM NaCl at pH 7.2. 


. 5 M NaCl. 


. Sulfosuccinimidyl 4-(N-maleimidomethyl)cyclohexane- 1- 


carboxylate (Sulfo-SMCC, no-weigh format). 


1. Ligation buffer: 100 mM sodium phosphate at pH 7.0. 


. Desalting column: PD-10 (e.g., GE Healthcare, Cat No: 


17-0851-01). 


. Strong anion-exchange spin column, mini (e.g., Thermo 


Scientific, Cat No: PI90008). 


2.4 Semi-Native 
SDS-PAGE 

of Reconstituted 
DNA-Directed Enzyme- 
Inhibitor Complex 


2.5 Enzyme Activity 
Assay of Reconstituted 
DNA-Directed Enzyme- 
Inhibitor Complex 


2.6 Enzyme Activity 
Assay at Different 
Input Oligonucleotide 
Concentrations 
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4. Buffer A: 20 mM Tris-HCl, 50 mM NaCl at pH 8.0. 


iS) 


N DO ov f& w 


. Buffer B: 20 mM Tris-HCl, 1 M NaCl at pH 8.0. 


. Complexation buffer: 50 mM sodium phosphate, 100 mM 


NaCl at pH 7.0. 


. 2x sample buffer: 125 mM Tris-HCl (pH 6.8), 4% (w/v) SDS, 


20% (v/v) glycerol, 0.01% bromophenol blue. 


. 30% (w/v) acrylamide/bisacrylamide solution (29:1, w/w). 

. N,N,N’, N’-tetramethylethylenediamine (TEMED). 

. 10% (w/v) ammonium persulfate solution in water. 

. Isopropanol. 

. Running buffer: 187 mM glycine, 19 mM Tris-HCl, 3.5 mM 


SDS. 


. Protein molecular weight marker, prestained. 


. Coomassie Brilliant Blue. 


. PBS*: 50 mM sodium phosphate, 100 mM NaCl, 1 mg/mL 


bovine serum albumin (BSA) at pH 7.0. 


. 50 pM ssDNA template stock solution dissolved in H,O 


(Table 1). 


. 50 pM ssDNA target stock solution dissolved in H,O 


(Table 1). 


. 500 pM nitrocefin, chromogenic B-lactamase substrate (e.g., 


EMD Millipore, VWR, Cat No: 80,017—707) dissolved in 
PBS. 


. Multiwell plate reader (with absorbance mode). 


. Transparent 96-well plates. 


. PBS*: 50 mM sodium phosphate, 100 mM NaCl, 1 mg/mL 


BSA at pH 7.0. 


. 50 pM ssDNA template stock solution dissolved in H,O 


(Table 1). 


. 50 pM ssDNA target stock solution dissolved in HO (Table 1). 
. 20 pM CCE2-FA, fluorescent fB-lactamase substrate (e.g., 


Invitrogen, Cat No: K1034) dissolved in PBS. 


. Multiwell plate reader (with fluorescence mode). 


. Black 96-well plate. 
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To facilitate hybridization-induced assembly of the enzyme-inhibitor 
pair the oligonucleotide handles have to be conjugated to the indi- 
vidual protein domains. In particular, cysteine is an attractive target 
for protein conjugation due to its low prevalence in natural proteins 
and ease of insertion using site-directed mutagenesis. In contrast 
to non-natural amino acids, which allow fully orthogonal protein 
conjugation chemistries, the incorporation of a nonnative cysteine 
typically does not result in a significant decrease in expression yield 
(~0.3 mg/L culture for both TEM1-f-lactamase and BLIP). 
To allow straightforward protein expression while ensuring site- 
specific conjugation of the oligonucleotide handles to the proteins 
(yielding homogeneous ODN-protein conjugates), thiol-maleimide 
coupling was used targeting a genetically inserted cysteine on the 
proteins. To install the maleimide moiety on the oligonucleotide 
handles, the heterobifunctional crosslinker Sulfo-SMCC, consist- 
ing of an NHS-ester, cyclohexyl spacer, and maleimide, was cou- 
pled to a 5’- or 3’-amine-modified oligonucleotide for BLIP and 
TEM1-f-lactamase, respectively. Subsequent ethanol precipitation 
of the maleimide-functionalized oligonucleotides allowed the 
removal of non-reacted Sulfo-SMCC. Finally, the oligonucleotides 
were conjugated to the cysteine inserted in the proteins (Fig. 2). 


sulfo-SMCC 
° 
IH 


DIO 


Fig. 2 (a) Crystal structure of a complex between TEM1-B-lactamase and BLIP (PDB: 3C7V). (b) ODN-protein 
conjugation strategy. Amine-modified oligonucleotides are reacted with the NHS-ester of the heterobifunc- 
tional crosslinker Sulfosuccinimidy! 4-(N-maleimidomethyl)cyclohexane-1-carbocylate (Sulfo-SMCC) install- 
ing a thiol-reactive maleimide moiety on the oligonucleotides. After the removal of the excess crosslinker by 
ethanol precipitation, the maleimide-activated oligonucleotides are reacted with a cysteine in the proteins. 
Adapted with permission from ref. 13. Copyright 2015 American Chemical Society 
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Conversion yields (typically 50-90%) can be estimated by SDS- 
PAGE analysis. Subsequent Ni**-affinity chromatography and 
anion-exchange chromatography to remove excess oligonucle- 
otides and unconjugated proteins, respectively, resulted in pure 
ODN-protein conjugates. 

Formation of the intermolecular ternary complex is performed 
by mixing ODNI-TEM1-f-lactamase, ssDNA template, and 
ODN2-BLIP in a 1:1.2:2 molar ratio. Using a slight excess of 
ssDNA template and ODN2-BLIP ensures that no free ODN1- 
TEM1-f-lactamase or ODN1-TEM1-f-lactamase-template 
remains, both of which would contribute to background activity. 
In order to confirm the formation of homogeneous ternary com- 
plexes upon hybridization of the individual components, the mix- 
tures were analyzed by semi-native PAGE. Quantitative conversion 
to the ternary complex was observed upon the addition of the 
ssDNA template to a mixture of the ODN-enzyme and ODN- 
inhibitor conjugates. A gel-shift to higher molecular weight after 
the addition of a complementary input oligonucleotide indicates 
the formation of the desired quaternary complex. Whereas semi- 
native PAGE suggests successful formation of the ternary DNA- 
directed enzyme-inhibitor complex, enzyme-activity assays were 
performed to test whether the enzyme-inhibitor interaction is 
indeed induced upon hybridization of both ODN-protein conju- 
gates to the ssDNA template strand. To this end, the complexes 
(containing templates with target recognition sequences of 0 to 50 
nucleotides) were diluted to a final concentration of 1 nM. After 
the addition of the colorimetric substrate nitrocefin, substrate con- 
version was monitored by measuring the optical absorbance of the 
formed product as a function of time. Whereas a strong inhibition 
of enzyme activity was observed with all target recognition 
sequence lengths, inhibition efficiencies increased as the target rec- 
ognition sequence length increases. This observation suggests that 
a certain degree of flexibility between the protein partners is pref- 
erable to allow more efficient intramolecular complex formation. 
In contrast, full recovery of enzyme activity was only achieved 
upon the addition of input sequences of >40 nucleotides, suggest- 
ing that a double-helix of at least 40 nucleotides is required to 
overcome flexibility in the protein termini and SMCC-linker and 
spatially separate the interacting protein partners. 

Making use of an enzyme as output domain provides the 
advantage over conventional fluorescent domains that substrate 
molecules are catalytically converted, hence the output signal accu- 
mulates. This allows the use of much lower concentrations of the 
sensor complex, hereby increasing the sensitivity of the input oli- 
gonucleotide detection. When using 100 pM of the DNA-specific 
biosensor together with the more sensitive fluorescent substrate 
CCE2-FA, as little as 10 pM of input oligonucleotide results in a 
distinguishable increase in enzyme activity. 


3.1 Expression 
and Purification 
of Recombinant 
Proteins 
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. Transform competent E. cols BL21 (DE3) cells separately with 


the pET29a_TEM1-f-lactamaseE104D and pET29a_BLIP 
plasmids (see Note 3). 


. For each construct, inoculate 20 mL of LB medium supple- 


mented with 30 mg/L kanamycin with a single colony of 
transformed bacteria and incubate overnight at 37 °C in a 
shaking incubator at 250 rpm. 


. Transfer the seed culture to 2 L of LB medium supplemented 


with 30 mg/L kanamycin. 


. Grow the bacteria in a shaking incubator at 200 rpm at 


37 °C. At an optical density at X = 600 nm of 0.6, add 2 mL of 
IPTG stock yielding a final concentration of 100 pM IPTG 
and incubate at 16 °C for 20 h. 


. Harvest the cells by centrifugation at 10,000 x g for 10 min 


at 4 °C. 


. Resuspend the pelleted cells in 300 mL osmotic shock solution 


and incubate for 10 min at room temperature. 


7. Pellet the cells by centrifugation at 12,000 x g for 20 min. 


10. 


11. 


12. 


13. 


14. 


. To extract the periplasmic fraction, resuspend the pelleted 


cells in 300 mL ice-cold low salt solution and incubate for 
10 min on ice prior to centrifugation at 40,000 x g for 40 min 
at 4 °C. 


. Add 10x Tris-HCl buffer to the supernatant (i.e., the periplasmic 


fraction) to yield a final concentration of 20 mM Tris-HCl. 


Charge a 2 mL His-bind column (gravity flow) with Ni** ions 
by washing the column with 10 column volumes of charge 


buffer and subsequently equilibrate with 10 column volumes 
bind buffer. 


Load the periplasmic fraction on the His-bind column and 
allow it to enter the column bed completely. Wash the column 
with 10 column volumes of wash buffer and subsequently elute 
the protein of interest with 10 column volumes of elution 
buffer. 


Concentrate the eluted proteins to a final volume of ~2.5 mL 
using the centrifugal filter unit. 


Exchange the buffer to storage buffer by gel filtration using a 
PD-10 desalting column and determine the concentration by 
measuring the optical absorption at 4 = 280 nm with the 
Nanodrop UV-VIS spectrophotometer and calculated extinction 
coefficients (see Note 4). 


Flash-freeze the purified proteins in liquid nitrogen and store 
at —80 °C. 
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3.2 Maleimide 
Functionalization 
of Amine-Modified 
Oligonucleotides 


3.3 Conjugation 
of Maleimide- 
Functionalized 
Oligonucleotides 
to Proteins 


1. 


2. 


Dissolve the amine-modified oligonucleotides in PBS to a 
concentration of ~1 mM. 


Dissolve 2 mg of Sulfo-SMCC (1 vial) in 300 pL DMSO to 
yield a stock concentration of ~20 mM. 


. Mix an equal volume of Sulfo-SMCC with the amine-modified 


oligonucleotides yielding a 20-fold molar excess of Sulfo- 
SMCC in 1:1 PBS:DMSO. 


. Incubate the mixture at room temperature under continuous 


shaking at 850 rpm for 2 h. 


. To precipitate the oligonucleotides, add 10% (v/v) 5 M NaCl 


and 300% (v/v) ethanol precooled to —30 °C, and store the 
mixture at —30 °C for at least 1 h. 


. Pellet the precipitated oligonucleotides by centrifugation at 


19,000 xg rpm for 15 min at 4 °C. 


. Dissolve the oligonucleotides in PBS to a concentration of 


~1 mM and repeat steps 5 and 6 of the ethanol precipitation 
procedure twice (see Note 5). 


. Finally, dry the pelleted oligonucleotides to air overnight at 


room temperature. 


. Immediately prior to oligonucleotide conjugation, desalt 


the proteins to ligation buffer to remove TCEP by gel filtra- 
tion using a PD-10 desalting column and dilute to ~50 pM 
(see Note 6). 


. Add the appropriate volume of protein solution to the dried 


oligonucleotide to yield a threefold molar excess of maleimide- 
functionalized oligonucleotide. 


. Incubate the mixture at room temperature under continuous 


shaking at 850 rpm for 2 h. 


. To remove excess oligonucleotides load the reaction mixture 


ona 1 mL His-bind column charged with Ni’* ions and equili- 
brated with bind buffer. Wash with 10 column volumes of 
bind buffer. 


. Elute the oligonucleotide-protein (ODN-protein) conjugates 


in 500 pL fractions using 10 column volumes of elution 
buffer. 


. Analyze the reaction efficiency and purity of the elution fractions 


by 12% SDS-PAGE. 


. Combine all elution fractions that contain ODN-protein 


conjugates. 


. Exchange the buffer to buffer A by gel filtration using a PD-10 


desalting column. 


10. 


11. 


12. 


13. 


14. 


15. 
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. Prewash a strong anion-exchange spin column by applying 


400 pL buffer A and spinning at 2000 x g for 5 min and dis- 
carding the flow-through. 


Repeatedly add 400 pL of desalted ODN-protein conjugates 
and centrifuge for 5 min at 2000 x g until the entire sample is 
loaded on the anion-exchange spin column. Discard the 
flow-through. 


Remove non-conjugated protein by adding 400 pL buffer A and 
centrifuging for 5 min at 2000 x g. Discard the flow-through. 
Repeat this step ten times. 


Elute the purified ODN-protein conjugate by adding 200 pL 
buffer B and centrifuging for 5 min at 2000 x gy. Repeat this 
step ten times. 


Analyze the elution fractions by 12% SDS-PAGE and combine 
all fractions containing the purified ODN-protein conjugates 
(Fig. 3). 

Quantify the concentrations of the ODN-protein conjugates 
by measuring the absorbance at 4 = 260 nm and using extinction 
coefficients of the oligonucleotides. 


Aliquot the purified ODN-protein conjugates in 20 pL fractions 
and flash-freeze in liquid nitrogen. Store the ODN-protein 
conjugates at —80 °C. 


buffer A wash buffer B elution 
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Fig. 3 SDS-PAGE analysis of the conjugation and anion exchange purification of a maleimide-functionalized 
oligonucleotide to a cysteine in (a) TEM1-B-lactamase and (b) BLIP on a 12% SDS-PAGE gel stained with 
Coomassie blue. (rxn: reaction mixture, buffer A wash: first five wash fractions with buffer A (low ionic strength) 
from the anion-exchange spin column, buffer B elution: first eight elution fractions with buffer B (high ionic 
strength) from the anion-exchange spin column 
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3.4 Semi-native 
SDS-PAGE 

of Reconstituted 
DNA-Directed Enzyme- 
Inhibitor Complex 


3.5 Enzyme Activity 
Assay of Reconstituted 
DNA-Directed Enzyme- 
Inhibitor Complex 


1. 


1. 


Prepare a 0.75-mm-thick 12% resolving gel by mixing 4 mL 
30% (w/v) acrylamide, 2.5 mL 1.5 M Tris-HCl pH 8.9, 
3.3 mL dH,O, 100 pL 10% (w/v) SDS, 100 pL 10% (w/v) 
ammonium persulfate and 6 pL TEMED. Cast the gel up to 
2 cm below the top and cover with a layer of isopropanol. 
Allow the gel to polymerize for 30 min. 


. Prepare a stacking gel by mixing 830 pL 30% (w/v) acryl- 


amide, 630 pL 1 M Tris-HCl pH 6.8, 3.4 mL dH,O, 50 pL 
10% (w/v) SDS, 50 pL 10% (w/v) ammonium persulfate, and 
3 pL TEMED. Pour the mixture on the top of the resolving 
gel and insert the comb. Allow the gel to polymerize for 
30 min. 


. To ensure efficient complex formation and inhibition of the 


enzyme in the assembled complex, sequentially mix ODN1- 
TEM1-f-lactamase, ssDNA template (with a target recogni- 
tion sequence length of 0, 10, 20, 30, 40, or 50 nucleotides) 
and ODN2-BLIP in a 1:1.2:2 ratio at low micromolar concen- 
trations in complexation buffer and incubate for 30 minutes at 
room temperature. 


. Mix the hybridized samples prepared in step 2 with an equal 


volume of 2x sample buffer and load it on a semi-native 12% 
SDS-PAGE gel prepared in step 1. Run the gel for 15 min at 
90 V followed by 1 h at 125 V. 


. Remove the gel from the glass plates and wash with dH,O for 


5 min. Subsequently stain the gel with Coomassie blue for 
30 min. Then destain the gel with dH,O until clear bands 
appear and image the gel (Fig. 4). 


Mix ODNI-TEM1-f-lactamase, ODN2-BLIP, and ssDNA 
template (with target recognition sequences of 0, 10, 20, 30, 
40, and 50 nucleotides) in PBS* at low micromolar concentra- 
tions in a ratio of 1:2:1.2 and incubate for 30 min at room 
temperature. 


. Dilute the mixtures in PBS* to a final concentration of 50 nM 


ODNI1-TEM1-B-lactamase. 


. In a 96-well plate, mix 4 pL of the diluted complexes with 


152 pL PBS* and 24 pL target oligonucleotides (from 100 nM 
stock solution in PBS*) and incubate for 1 h at room 
temperature. 


. Add 20 pL nitrocefin (from 500 pM stock solution in PBS) 


yielding a final concentration of 1 nM DNA-directed enzyme- 
inhibitor complex, 12 nM target oligonucleotide, and 50 pM 
nitrocefin. 


. Directly place the plate in the platereader and proceed with 


measuring the optical absorbance in time of the formed product 
at 486 nm. 
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ODN1-TEM1-B-LacE104D s+ 
ODN2-BLIP 
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Fig. 4 Semi-native 12% SDS-PAGE analysis of the reconstitution of the DNA-directed enzyme-inhibitor pair. 
Low micromolar concentrations of the synthetic protein switch components were sequentially mixed at a ratio 
of 1:1.2:2 of ODN1-TEM1-f-lactamaseE104D, ssDNA template, and ODN2-BLIP. Lanes 6, 8-72 show efficient 
formation of the ternary complex containing different lengths of target recognition sequence. A gel-shift to 
higher molecular weight is observed upon the addition of 10 equivalents of target oligonucleotide (30 nucleo- 
tides), indicating efficient binding of the input oligonucleotide to the synthetic protein switch. Adapted with 
permission from ref. 13. Copyright 2015 American Chemical Society 


3.6 Enzyme Activity 1. 


Assay at Sub- 
nanomolar Input 
Oligonucleotide 


Concentrations 2. 


. Derive the hydrolysis rate by calculating the slope of the absor- 


bance in time of the initial, linear regime. Correct for background 
hydrolysis of nitrocefin in the absence of enzyme (Fig. 5). 


Mix ODNI1-TEM1-f-lactamase, ODN2-BLIP, and ssDNA 
template (with a target recognition sequence length of 40 nucle- 
otides) in PBS* at low micromolar concentrations in a ratio of 
1:2:1.2 and incubate for 30 min at room temperature. 


Dilute the mixture in PBS* to a final concentration of 5 nM 
ODNI1-TEM1-B-lactamase. 


. In a 96-well plate, mix 4 pL of the diluted complex with 


152 pL PBS* and 24 pL target oligonucleotide (concentrations 
ranging from 100 pM to 20 nM) and incubate for 1 h at room 
temperature. 


. Add 20 pL CCF2-FA (20 uM in PBS), yielding a final concen- 


tration of 100 pM DNA-directed enzyme-inhibitor complex, 
10 pM to 2 nM target oligonucleotide, and 2 uM CCF2-FA. 
Incubate for 45 min at room temperature. 
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4 Notes 


(J Target 0 
() Target 10 


Gl ODN1-B-lacE104D 


GEM ODN1-f-lacE104D + ODN2-BLIP 


Hydrolysis rate 


0 10 20 30 40 50 
Target recognition sequence length (bases) 


Fig. 5 Influence of target recognition sequence length and target oligonucleotide 
lengths on enzyme inhibition and enzyme activation, respectively. ODN1-TEM1- 
6-lactamaseE104D, template oligonucleotide (with various target recognition 
sequence lengths), and ODN2-BLIP were incubated in a 1:1.2:2 ratio at low 
micromolar concentrations. After subsequently diluting the mixtures to 1 nM 
concentration of ODN1-TEM1-f-lactamaseE104D in the presence of 12 nM tar- 
get oligonucleotides of various lengths, the colorimetric substrate Nitrocefin was 
added and substrate conversion was measured. Adapted with permission from 
ref. 13. Copyright 2015 American Chemical Society 


5. 


Place the plate in the platereader and monitor the formation of 
the fluorescent product at 409 nm excitation and 447 nm 
emission. 


. Derive the hydrolysis rate by calculating the slope of the 


fluorescence in time within the linear regime (Fig. 6). 


. Previous work has shown that conjugation efficiencies greatly 


depend on the steric availability of the inserted cysteine. In 
case of low conversion efficiencies insert a flexible GGSGG 
linker between the protein and cysteine [10]. 


. Plasmids encoding for TEM1-B-lactamase and BLIP will be 


made available via AddGene. 


. Based on previous work using the intramolecular complex for- 


mation between TEM1-f-lactamase and BLIP as output mod- 
ule for genetically encoded protein switches, we here use the 
E104D mutant of TEM1-f-lactamase (K, = 1500 nM) as the 
wild-type proteins bind too strongly (K; = 0.5 nM), which might 
result in intermolecular complex formation and oligomerization 
of the DNA-directed protein switch. 


. TCEP is added to the storage buffer to reduce intermolecular 


disulfide bonds that render the genetically inserted cysteine 
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Fig. 6 Enzyme activity as a function of target oligonucleotide concentration. 
ODN1-TEM1-f-lactamaseE104D, template oligonucleotide (with a target recog- 
nition sequence of 40 nucleotides), and ODN2-BLIP were incubated at low 
micromolar concentrations to form the inhibited ternary complex. Subsequently, 
the complex was diluted to 100 pM of ODN1-TEM1-f-lactamaseE104D in the 
presence of various concentrations of target oligonucleotide. After the addition of 
the fluorescent substrate CCF2-FA product formation was measured by measur- 
ing the increase in fluorescence in time (A, = 409 nm, Aen = 447 nm). A linear 
increase in enzyme activity as a function of target concentration is observed up 
to 1 equivalent of the template oligonucleotide, suggesting tight binding between 
the ternary complex and target oligonucleotide. Adapted with permission from 
ref. 13. Copyright 2015 American Chemical Society 


nonreactive toward maleimides. TCEP can also reduce native, 
intramolecular disulfide-bonds. However, the BLIP and 
TEM1-f-lactamase proteins did not show any (irreversible) 
reduction of intramolecular disulfides. 


5. To ensure efficient conjugation of maleimide-functionalized 
oligonucleotides to the proteins of interest, it is crucial to per- 
form at least three rounds of ethanol precipitation after react- 
ing the amine-modified oligonucleotides with Sulfo-SMCC to 
ensure that all excess Sulfo-SMCC is removed since this will 
compete for conjugation to the protein. A single round of eth- 
anol precipitation has shown to significantly reduce final con- 
jugation efficiencies. 


6. To avoid the formation of nonreactive disulfide-linked protein 
dimers directly proceed with mixing the desalted proteins (1.e., 
after removing TCEP) with the maleimide-functionalized oli- 
gonucleotides. Various reports have published mixed results 
with TCEP interfering with maleimide conjugation. In our 
experience, TCEP significantly lowers conversion yields and is 
therefore removed prior to the reaction with the maleimide- 
functionalized oligonucleotide. 
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Part V 


Proteolytic Sensors 


Chapter 13 


Engineering and Characterizing Synthetic Protease 
Sensors and Switches 


Viktor Stein and Kirill Alexandrov 


Abstract 


Proteases are finding an increasing number of applications as molecular tools and reporters in biotechnology 
and basic research. Proteases are also increasingly incorporated into synthetic genetic signaling circuits 
equipping cells with tailored new functions. In the majority of cases however, proteases are employed in 
constitutively active forms which limits their utility and application as molecular sensors. The following 
chapter provides a detailed experimental protocol for converting constitutively active proteases into regulated 
protease receptors. Such receptors can potentially sense, transduce, and amplify any molecular input, 
thereby opening up a range of new applications in basic research, biotechnology, and synthetic biology. 


Key words Protein switches, Protein engineering, Proteases, Diagnostic reagents 


1. Introduction 


1.1 Proteases Proteases constitute one of the most abundant classes of enzymes 
as Versatile Tools that execute key physiological functions across all kingdoms of 
in Biotechnology life [1]. Proteases have evolved to function inside as well as outside 
and Basic Research cellular environments taking on diverse roles such as regulating cell 


death, controlling blood coagulation, digesting nutrients, and 
remodeling the extracellular environment. Chemically, proteases 
catalyze the irreversible cleavage of a peptide bond. Their substrate 
specificity ranges from peptide motives as short as two amino acids, 
to complex tertiary interactions that are mediated by structurally 
distinct protein domains. Given the central importance of prote- 
ases to many physiological processes and the irreversible nature of 
the proteolytic cleavage, protease function is frequently tightly 
regulated. One common mechanism relies on the expression of 
inactive zymogens that require additional posttranslational pro- 
cessing by an activating protease that cleaves off an active site- 
directed inhibitor or triggers complex conformational 
rearrangements allowing a protease to transition into a catalytically 
active conformation. 


Viktor Stein (ed.), Synthetic Protein Switches: Methods and Protocols, Methods in Molecular Biology, vol. 1596, 
DOI 10.1007/978-1-4939-6940-1_13, © Springer Science+Business Media LLC 2017 
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1.2 Choice 
of Proteases 
as Elementary Signal 
Transducers 


Proteases are also increasingly applied in biotechnology as 
therapeutic and diagnostic reagents, or in basic research as molecu- 
lar reporters and tools in the production of recombinant proteins. 
In addition, proteases are finding increasing application in synthetic 
biology as components of synthetic signaling circuits that equip 
cells with new tailor-engineered functions [2, 3]. The key advan- 
tage here is the fact that, by virtue of cleaving a peptide bond, they 
can be readily interfaced with any protein-associated biological 
function. Notably, in comparison to alternative posttranslational 
protein modifications such as protein phosphorylation, proteolysis 
constitutes a drastic conformational change as one polypeptide is 
split into two with the two resultant polypeptides transitioning 
from an intra- to an intermolecular state. This transition is associ- 
ated with a significant change in free energy that can be readily 
exploited to engineer protease-inducible actuators: e.g. to activate 
protein function by cleaving off an autoinhibitory (AI) domain, or 
to inhibit protein function by cleaving functionally important resi- 
dues. Consequently, a capacity to engineer synthetic protease 
receptors in a systematic fashion constitutes a powerful tool to 
engineer biological signaling transduction systems. The following 
protocol chapter gives a detailed guide how to engineer and char- 
acterize synthetic protease switches with custom response func- 
tions focusing on four elementary types of signaling sensing and 
transducing mechanisms: This includes autoinhibited protease 
transducers, allosteric protease receptors, integrated sensing and 
amplification circuits, as well as proximity-dependent protease 
sensors (Fig. 1). 


Elementary protease transducers, allosteric protease receptors, and 
proximity-dependent protease sensors are engineered by recom- 
bining naturally occurring proteases with binding receptors and 
AlI-domains that bind and block the active site of the protease and 
thus shut down its activity in the basal state. Individual compo- 
nents can either be derived from natural sources or be artificially 
engineered. Given the modular organization of components, the 
construction of synthetic protease switches typically occurs in a 
step-wise fashion. This means, protease transducers, their cognate 
Al-domains, and binding receptors can be first engineered and 
characterized individually before they are assembled into func- 
tional switches, while empirically optimizing the structure and 
length of the linkers connecting individual domains. 

In our initial proof-of-concept studies, we chose established 
proteases based on members of the Nuclear Inclusion a (NIa) fam- 
ily. The most prominent members are Tobacco Etch Virus (TEV) 
and Tobacco Vein Mottling Virus (TVMV) protease that have 
found widespread applications as tools in molecular and cell 
biological research [4]. In addition, we chose the NS3 protease of 
Hepatitis C Virus (HCV) which has been extensively studied as a 
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Fig. 1 Schematic summary of different types of synthetic protease sensors and amplification circuits based on 
modularly organized autoinhibited protease modules. (a) An elementary autoinhibited protease transducer P1 
can be engineered by connecting a competitive autoinhibition domain that can be specifically cleaved off by 
an activating protease P2. (b) An allosterically regulated protease receptor P1 can be engineered by replacing 
the cleavage site for an activating protease with an allosteric binding receptor R1-R2 that undergoes a large 
conformational change upon binding its cognate ligand L. (c) To accelerate the response time and improve 
sensitivity, the signal generated by the primary allosterically regulated protease sensor P1 can be transduced 
to cleave and activate a secondary protease amplifier P2. (d) Alternatively, activation of the secondary amplifier 
can be based on the ligand L induced colocalization of a primary transducer P1 with a secondary amplifier P2 
to engineer a proximity-dependent protein-protein interaction sensor 


drug target, but is also increasingly adopted to develop tools for 
basic research. Notably, a range of peptide-based, active site directed 
inhibitors have been devised for HCV that can serve as Al-domains 
and readily be identified from the existing literature [5]. 

In general terms, proteases are preferably well characterized 
both biochemically and structurally. The availability of large pro- 
tein families is also advantageous as it allows to infer biochemical 
and structural information from phylogenetic data. Notably, for 
viral proteases whose native function is to process the viral polypro- 
tein, the putative cleavage site can be directly identified from the 
genomic sequence requiring little experimental profiling of pro- 
teolytic cleavage sites. The availability of biochemical and X-ray 
structural data also suggests that it is possible to produce and engi- 
neer a particular protease through the recombinant route in E. colt. 
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1.3 Choice 

of Binding Receptors 
as Molecular 
Recognition Elements 


1.4 General 
Considerations 

on the Construction 
of Al-Domains 


This greatly facilitates their subsequent purification, in vitro char- 
acterization, and engineering by means of rational design or high- 
throughput screening approaches. 

The second consideration concerns structural features of the 
protease: Ideally, the N- and C-termini of the protease are located 
in close proximity of the active site. In this way, an Al-domain is 
displayed at a high effective concentration such that it can effec- 
tively compete with the substrate and thus shut down the activity 
of the protease transducer in the basal state. Conversely, its activity 
can be strongly induced after the Al-domain is irreversibly cleaved 
off by an activating protease or reversibly dislodged from the active 
site through a binding event. The exact mechanism of activation 
ultimately depends on the molecular recognition elements in the 
linker connecting the AIl-domain with the protease transducer. 


Similar considerations concern the choice of ligand-binding recep- 
tors that serve as molecular recognition elements to detect the tar- 
get analyte. Binding receptors can either recognize two distinct 
epitopes with separate binding sites on the same target analyte or 
feature an allosteric receptor that undergoes a significant confor- 
mational change upon binding the target analyte. In case of the 
latter, binding receptors that undergo cooperative binding interac- 
tions upon associating with their target analyte are preferred as this 
overcomes potential problems with multivalent interactions and 
the subsequent formation of oligomeric complexes. For instance, 
we exploited a binding receptor based on artificially engineered 
affinity clamps consisting of a circularly permutated PDZ domain 
attached to an enhancer domain that recognizes the PDZ domain 
exclusively in its ligand bound form and thus forms a sandwich 
complex upon binding its cognate ligand [6, 7]. 

From a protein engineer’s perspective, it is highly desirable to 
recombine autoinhibited protease modules with distinct families of 
binding receptors that display similar structural and biophysical 
characteristics, yet diverse ligand specificities. In this way, the pro- 
tein engineering process is rendered as generally applicable as pos- 
sible. In particular, the orientation of the N- and C-termini is 
critical if a distinct class of binding receptors is to be repeatedly 
recombined with an autoinhibited protease module with little 
empirical optimization to yield synthetic protease sensors with 
diverse ligand specificities. Ideally, a distinct class of binding 
receptors can be sourced from naturally occurring receptor fami- 
lies. Alternatively, binding receptors can be engineered using stan- 
dardized work flows as for example recombinant antibody-like 
binders that can be derived using a variety of display technologies. 


Al-domains are engineered using a combination of rational, 
structure-guided protein engineering in combination with high- 
throughput screening. Notably, for protease-based signal transducers, 
the proteolytic cleavage product provides a good molecular lead to 


2 Materials 


2.1 General 
Reagents 
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engineer genetically encoded, active site-directed competitive inhibitors 
that can be directly fused to the protease transducer. Depending 
on their size, individual cleavage products can either be appended 
to the N- or C-terminus. Considering the N-terminal cleavage 
product of TVMV comprises five of six amino acids of the core 
substrate motif, this means the product-based competitive inhibitor 
needs to be appended to the C-terminus. In many cases, appending a 
product-based inhibitor is however insufficient to ensure a high 
enough level of competitive autoinhibition. As a result, its binding 
affinity needs to be improved using a combination of structure-guided 
protein engineering and high-throughput combinatorial screening, 
for instance, by introducing additional charge and shape complemen- 
tarity between the AlI-domain and the protease transducer or by 
restricting the orientation of the connecting linker. 

In addition, a key concern is that recombinant protein expres- 
sion leads to pre-mature termination of translation and therefore 
truncated protein products with no Al-domain. This may be fur- 
ther aggravated by limited nonspecific proteolysis of exposed 
and/or flexible linkers during protein expression and/or subse- 
quent cell lysis. This is particularly problematic for modularly 
organized protease switches that rely on C-terminal Al-domains 
as this limits the extent to which a synthetic protein switch is 
autoinhibited in the basal state. For instance, if 10% of a purified 
protein is truncated with no Al-domain, its maximum activation 
is limited to tenfold. For C-terminally positioned Al-domains, it 
is therefore paramount to purify full-length protein through an 
affinity tag that is located C-terminal of the AIl-domain. This also 
requires bridging the P1-P1’ junction through a dipeptide motif 
that binds, but at the same time cannot be cleaved. This is how- 
ever not trivial for proteases. Bridging the P1-P1’ junction 
through an uncleavable dipeptide motif also allows to place the 
Al-domain in the N-terminal position and engineer additional 
affinity features beyond the core cleavage product to enhance 
binding. An N-terminal Al-domain also facilitates the detection 
of weakly enhancing mutations, especially if their contribution 
toward affinity is incremental and thus difficult to resolve if the 
induction is limited by prematurely truncated protein expression 
products in cell lysates. Consequently, it is recommended to first 
engineer dipeptide motifs that can bridge the P1-P1’ junction 
before improving the affinity of Al-domain for its cognate protease 
receptor. 


1. 100 mg/mL carbenicillin (1000x stock). 
2. 50 mg/mL kanamycin (1000x stock). 
3. 34 mg/mL chloramphenicol (1000x stock). 
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2.2 Construction 
of DNA Libraries 
with USER Enzyme 


2.3 High-Throughput 
Expression 

of Synthetic Protease 
Receptors 


4. 


1. 


ne WwW bd 


1. 


2. 


LB medium: 10 g peptone, 5 g yeast extract, and 10 g NaCl in 
1 LdH,O. 


. LB agar plates: 10 g peptone, 5 g yeast extract, 10 g NaCl, and 


15 g agar in 1 LdH,O. 


Chemically competent Top10 E. col cells (or equivalent) for 
DNA cloning. 


. Pfu Cy DNA polymerase (Agilent). 

. 10x Pfu C, reaction buffer. 

. 1O mM dNTPs each. 

. Primer-For: 5’-GCTGAAGTCTTACGAGGAAGAGTTGGC-3’ 


(Tu = 70.5 °C). 


. Primer-C-Ter: 5’-ACGGTUTCGCGACCTACACCG-3’ 


(Tee 721°C), 


. Primer-Library: 5’/-AACCGUGCGCTTTNNNNNNGGAA 


GCACCCACCACCATCAT-3/ (Ty = 70.3 °C). 


. Primer-Rev.: 5’-CGTTGTAAAACGACGGCCAGTG-3’ (Ty 


= 70.3 °C). 


. Vector for the expression of recombinant NIa proteases with 


maltose binding protein (MBP): e.g., pRK/793 based on 
pMAL-C2. 


. 10x T4 DNA ligase buffer (NEB). 

. 400 U/pL T4 DNA ligase (NEB). 

. 1 U/pL USER Enzyme Mix (NEB). 

. Restriction enzymes NcoI and BamHI to digest the pRK793 


vector backbone. 


. Gibson Assembly Kit. 
. Wizard SV Gel and PCR Clean-Up System. 
. Miniprep plasmid DNA purification system. 


Chemically competent BL21(DE3)-RIL E. coli cells (or equiv- 
alent) for protein expression. 

Plasmid 05665 featuring the bacteriophage 4 SRRz autolysis 
cassette [8, 9] under the control of the tetracycline inducible 
promoter. 


. 96-deep-well-plates to grow E. coli in high throughput. 


4. 50x 5052 medium: 25% glycerol (w/v), 2.5% glucose (w/v) 


and 10% a-lactose (w/v) [10]. 


. 20x NPS: 0.5 M (NH:z)2SO,, 1M KH,PO, and 1M Na,HPO, 


pH 6.75. 
1 M MgSO,. 


2.4 Expression 1. 


and Purification 
of TVMV and HCV 2 
Protease Switches 
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. 1000x trace metal solution: 50 mM FeCl;, 20 mM CaCl, 10 mM 


MnCl, 10 mM ZnSO,, 2 mM CoCh, 2 mM CuCl, 2 mM 
NiClL,, 2 mM Na,MoQ,, 2 mM Na,SeO;3, 2 mM H3BO3. 


. P-5052 autoinduction medium: 1x 5052 medium, 1x NPS, 


1x trace metal and 1 mM MgSO, diluted in ddH,O. 


. 200 pg/mL tetracycline dissolved in ethanol (1000x stock). 
10. 


P-5052 autolysis medium: 1x 5052 medium, 1x NPS and 
1 mM MgSO, diluted in MQ supplemented with 200 ng/mL 
tetracycline. 


Terrific broth (TB): 1.2% (w/v) tryptone, 2.4% (wt/vol) yeast 
extract, 0.04% glycerol, 0.17 M KH»POx,, and 0.72 M K,HPOsg. 


. TB-based autoinduction medium: TB supplemented with 0.2% 


lactose, 0.05% glucose, and 2 mM MgCl. 


. Phosphate buffered saline (PBS): 10 mM Na,HPO,, 2 mM 


KH,PO,, 137 mM NaCl, and 2.7 mM KCl, pH 7.4. 


. Washing and binding buffer: 20 mM Na,HPO, and 20 mM 


imidazole pH 8.0 supplemented with 300 mM NaCl for 
TVMV"™-AI protease transducers, 500 mM NaCl for 
HCV'™Y-AT protease transducers, and 1 M NaCl for TVMV- 
FEN3-PDZ-AI allosteric receptors. 


. Elution buffer: 20 mM Na,HPO, and 500 mM imidazole 


pH 8.0 supplemented with 300 mM NaCl for TVMV™"-AI 
protease transducers, 500 mM NaCl for HCV'YY-AT, and 
1 M for TVMV-EN3-PDZ-AI allosteric receptors. 


. Protein storage buffer: 50 mM Tris-HCl and 10% (v/v) 


glycerol, pH 8.0 supplemented with 1 mM EDTA and 2 mM 
DTT for TVMV""-AI protease transducers, 1 M NaCl, 1 mM 
EDTA, and 2 mM DTT for TVMV-FN3-PDZ-AI allosteric 
receptors, and 500 mM NaCl and 2 mM 6-mercaptoethanol 
for HCV'YMY-AT protease transducers. 


7. 0.25-um nitrocellulose filters. 


2.5 Assay Synthetic 1. 


Protease Receptors 


. Ni-NTA columns (e.g., 5 mL HisTrap FF Crude from GE 


Healthcare). 


. Centrifugal filters with 10-kDa cutoff. 
. PD-10 desalting columns. 


Protease Assay Buffer: 50 mM Tris-HCl, 100 mM NaCl, 
50 pg/mL BSA and 2 mM DTT, pH 8.0 (see Note 1). 


. Summary of protease reagents and their working concentra- 


tions is given in Table 1. 


. Summary of protease peptide substrate, ligand peptides, and 


AI domains used to characterize individual protease receptors 
is provided in Table 2. 
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Table 1 
Summary of working concentration of individual protease switches in different assays and 
applications 


Concentration Concentration Type of measurement or 
Protease switch (final assay) (4x Stock) Target analyte application 
TVMV™-AI 500 nM 2 uM Thrombin Maximum induction ratio 
HCV'"™-AI 500 nM 2 uM TVMV Maximum induction ratio 
TVMV-FN3-PDZ-AI 250 nM 1 uM Ligand Bl Maximum induction Ratio 
TVMV-FEN3-PDZ -AI 250 nM 1 uM Ligand B2 Maximum induction ratio 
TVMV-EN3-PDZ -AI 100 nM 400 nM Ligand Bl Ky determination 
TVMV-FN3-PDZ -AI 10 nM 40 nM Ligand B2 Ky determination 
SH3?°"-TVMV-EN3- 10 nM 40 nM Ligand Bl Integrated signal sensing 
PDZ-AI and B2 and amplification circuit 
SH3??-HCV1YMV-AT 200 nM 800 nM Ligand Bl Integrated signal sensing 
and B2 and amplification circuit 
FKBP12-TVMV"™-AI 100 pM 400 pM Rapamycin Proximity sensor 
HCV'""-AI-FRB 200 nM 800 nM Rapamycin Proximity sensor 


Table 2 
Summary of protease substrates, affinity clamp ligands, and Al domains 


Protease Substrate Sequence Comments 

TVMV ANA-GETVRFQSDT(164)-NH, Used in library screening and protease activity assays 

HCV ANA-DDVTPCSMS(164)-NH, — Used in library screening and protease activity assays 

TVMV-DD ANA-GETVRFQSDT(164) Used for measuring enzyme kinetics. Contains 

DD-NH, additional Asp residues for improved water 
solubility 

Bl Ligand NH,)-RGSIDTWV-COOH Kp = 0.6 nM for the native affinity clamp ePDZ-b1 
(“Strong”) 

B2 Ligand NH,)-PQPVDSWV-COOH Kp = 5 nM for the native affinity clamp ePDZ-b1 
(“Weak”) 


AI Domain NH>-EYVRFAPGST-COOH TVMV specific AI domain 


ANA refers to a 5-amino-2-nitrobenzoyl group 
164 refers to a Mimotopes specific code for lysine coupled to a 7-methoxycoumarinyl-4-acetyl group 


4. 8mM TVMV peptide substrate dissolved in DMSO and stored 
at —80 °C. 


5. 20 uM TVMV peptide substrate diluted in protease assay buffer 
and stored on ice. 


2.6 Equipment 


3 Method 


3.1 Cloning 
Autoinhibited 
Proteases Modules 
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6. 8 mM HCV peptide substrate dissolved in DMSO and stored 
at —80 °C. 

7. 20 uM HCV peptide substrate diluted in protease assay buffer 
and stored on ice. 


8. 8 mM TVMV-DD peptide substrate dissolved in DMSO and 
stored at —80 °C. 


9. 800 uM TVMV-DD peptide substrate diluted in protease assay 
buffer and stored on ice (see Note 2). 


10. 1 U/pL thrombin dissolved in 50 mM Tris-HCl pH 8.0 and 
stored at —80 °C. 


11. 1 mM Ligand B1 dissolved in 50 mM Tris-HCl pH 8.0 and 
stored at —80 °C. 


12. 1 mM Ligand B2 dissolved in 50 mM Tris-HCl pH 8.0 and 
stored at —80 °C. 


13. 100 mM AI domain NH,-EYVRFAPGST-COOH as a synthetic 
peptide dissolved in DMSO and stored at —80 °C. 


14. 200 pg/mL rapamycin stock solution dissolved in ethanol and 
stored at —80 °C. 


15. Black 96-well plates for fluorescence measurements. 


16. Optical adhesive film (to cover black 96-well plates for fluores- 
cence spectroscopy). 


. PCR thermocycler. 

. Agarose gel electrophoresis equipment. 

. SDS-PAGE equipment. 

. Fluorescence multiwell plate reader (e.g., Biotek Synergy 4). 
. Shaking incubator to grow E. coli in liquid medium. 


. Static incubator to grow E. coli on agar plates. 


NDA wo ® W WN FE 


. One shot cell disruptor for E. coli cell lysis. 


Viral protease receptors are expressed as fusion proteins with 
maltose-binding protein (MBP) based on pRK793 [11]. This con- 
struct has previously been developed to facilitate the expression of 
the NIa protease from Tobacco Etch Virus (TEV) where MBP acts 
as a molecular chaperone to enable more efficient recombinant 
expression of NIa proteases in E. coli[12, 13]. To construct focused 
libraries, a variety of DNA library generation and DNA assembly 
techniques are available. For instance, combinatorial DNA libraries 
with degenerate codons coding for different amino acids are suitable 
to bridge the P1-P1’ junction or improve the affinity between the 
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AlI-domain for its cognate protease transducer. In contrast, trun- 
cated DNA libraries are typically applied to optimize the length 
and structure of the linkers connecting functional domains. An 
exemplary protocol to engineer dipeptide motifs that can bind 
across the P1-P1’ junction without being cleaved is exemplified 
based on USER Enzyme-dependent DNA assembly [14]. 


1. Devise a cloning strategy for a focused DNA library to engi- 
neer a competitively autoinhibited protease module taking into 
account structural and biochemical data from the existing lit- 
erature (Fig. 2). For instance, in case of TVMV, the design of 
uncleavable P1-P1’ junctions for the Al-domain is based on 
substrate mapping data for the related TEV protease which 
suggests that proline in the Pl’ position prevents cleavage 
[15]. It is however unclear to what extent sequences with a 
proline in the P1’ position cannot bind or can bind, but cannot 
be cleaved by TEV protease. Similarly, it is not clear to what 
extent this holds true for other members of the NIa potyvirus 
protease family including the NIa protease of TVMV [2], but 
this needs to be screened and tested experimentally. 


2. Commercially synthesize the gene coding for the autoinhib- 
ited protease module of interest (Fig. 2). For TVMV, include a 
cleavage site for autoproteolytic processing followed by TVMV 
protease, a linker including a thrombin cleavage site, its cog- 
nate substrate sequence (which serves as a lead structure for 
engineering competitive Al-domains), and an affinity purifica- 
tion tag and insert into pRK793 via NcoI and BamHI. 


3. Generate a DNA fragment coding for the N-terminal portion 
by means of PCR using Pfu C, DNA polymerase, 1x Pfu C, 
reaction buffer, 10 mM dNTPs each, 1 ng template DNA cod- 
ing for the elementary protease module (step 2), and primers 
Primer-For and Primer-C-Ter. 


4. Generate a DNA fragment coding for the C-terminal portion 
by means of PCR using Pfu C, DNA polymerase, 1x Pfu C, 
reaction buffer, 10 mM dNTPs each, 1 ng template DNA coding 


> 
Fig. 2 Summary of cloning procedure to engineer autoinhibited protease modules exemplified with the Nla 
protease of TVMV. (a) First commercially gene synthesize an autoinhibited protease module that serves as a 
template for subsequent engineering attempts. This includes restriction sites Ncol and BamHI flanked by 50 bp 
homology regions to enable integration of the synthetic DNA fragment into a target vector, in this case pRK793, 
by means of Gibson Assembly. The autoinhibited protease module contains an N-terminal cleavage site for 
autocatalytic processing from MBP and a C-terminal cleavage site as the lead for a product-based competitive 
inhibitor. The latter is additionally separated by a cleavage site for thrombin and an affinity tag to enable puri- 
fication of the full-length protease transducer. (b, c) Using suitable primer pairs amplify DNA fragments coding 
for the N- and C-terminal portion of TVMV while randomizing the P1-P1’ junction using degenerate codons that 
will ultimately yield a dipeptide motif that can bind across the P1-P1’ junction, yet cannot be cleaved. 
Recombination of the two DNA fragments is mediated through USER enzyme-dependent ligation. (d) Summary 
of the USER enzyme-dependent recombination site including the coding amino acid sequence 
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(a) Synthetic DNA Fragment Coding for Elementary Autoinhibited Protease Module 


gctgaagtcttacgaggaagagttggcgaaagatccacgtattgcecgcca Siemmepaaaacgceccagaaa 
ggtgaaatcatgccgaacatccctcagatgtccgctttctggtatgcecgtgcgtactgcggtgatcaacg 
ccgccagcggtcgtcagactgtcgatgaagccctgaaagacgcgcagactaattcgatcacaagtttgta 
Caaaaaagcaggctcggaaaccgtgcgtttccagtctggtggttctgggtcttctaaagctttgctgaag 


ggcgtgcgcgattttaatccgatctctgcttgcgtatgcctgctggaaaactcctcggatggtcatagtg 


agtctggaagcacccaccaccatcatcatcactgaRfpbmest ctagagtcgacctgcaggcaagcettgge 


actggccgtcgttttacaacg 


Sequence Annotation 


- Restriction sites Ncol and BamHI, and USER Enzyme recombination sites arehighlighted in 
- 50bp homologous recombination sites with vector backbone are underlined 

- TVMV cleavage sites are double-underlined 

- Thrombin cleavage site is wave-underlined 

- His affinity purification tag is thick-underlined 

-  P1-P1’ junction is highlighted in grey 


(b) Primer Pair N-terminal TVMV Fragment 


5’ -GCTGAAGTCTTACGAGGAAGAGTTGGC- 3’ Tu. 705. °C 


5’ -BIccctTjrcGcGACCTACACCG-3’ Tu: 72.1 °C 


(c) Primer Pair C-terminal TVMVFragmentwith Randomized P1-P1’ Junction 


5’ -MacccfjccGCTTTNNNNNNGGAAGCACCCACCACCATCAT-3’ Tue 70.3 ~C 


5’ -CGTTGTAAAACGACGGCCAGTG-3’ Tus 70.3. 'C 


(d) USER Enzyme Dependent Recombination Site with Randomized P1-P1’ Junction 
G V G R E T V R F Ge G S$ T H H H FH 


5’ -BacccfjccccTTTNNNNNNGGAAGCACCCACCACCATCAT-3’ 
3’ -GCCACATCCAGCGCTJTGGCEN- 5’ 
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for the elementary protease module (step 2), and primers 
Primer-Library and Primer-Rev. 


5. Mix equimolar amounts of the N- and C-terminal coding 
DNA fragments generated in steps 3 and 4 together with 1 U 
USER Enzyme per pg of total DNA and 200 U T4 DNA ligase 
per pg DNA in 1x T4 DNA ligase buffer. Incubate for 60 min 
at37 °C, 


6. Purify the assembly reaction using a suitable PCR purification kit. 


7. Check the quality of USER-mediated enzyme DNA assembly 
reaction by means of agarose gel electrophoresis. Typically, 
one major band corresponding to the desired DNA assembly 
product along with the two minor bands of the DNA assembly 
substrates is visible. 


8. Optional: Purity the desired DNA fragment by means of aga- 
rose gel electrophoresis (see Note 3). 


9. Perform a Gibson Assembly reaction to recombine the DNA 
insert library into a suitable expression vector as per manufac- 
turer’s instructions. We typically use the vector pRK793 back- 
bone based on the pMAL series cut with NcoI and BamHI 
(see Note 4). 


10. Transform 5 pL of the Gibson Assembly mix into 50 pL chem- 
ically competent £. coli cells and plate on LB agar plates sup- 
plemented with 100 pg/mL carbenicillin. 


11. To check the quality of the library, inoculate single colonies in 
5 mL LB medium supplemented with 100 pg/mL carbenicil- 
lin, purify plasmid DNA and sequence. 


3.2 Engineering Once a DNA library has been generated, it needs to be screened 
Autoinhibited Protease — experimentally to identify mutants with the desired properties. The 
Modules: High- following generic protocol can be used to screen the function of 


Throughput Screening autoinhibited protease modules, sensors, and switches aiming to 
optimize the binding strength of the Al-domain, or the length and 
structure of the connecting linkers. 


1. Transform the library of autoinhibited TVMV mutants into 
chemically competent BL21(DE3)-RIL cells hosting the 
autolysis plasmid 05665 (see Note 5). 


2. Plate transformed cells on LB agar plates supplemented with 
100 pg/mL carbenicillin, 50 pg/mL kanamycin, and 34 pg/mL 
chloramphenicol. 


3. Following overnight incubation at 37 °C, inoculate single col- 
onies into 96-deep-well plates filled with 1 mL minimal PA-5052 
autoinduction medium [10] supplemented with 100 pg/mL 
carbenicillin, 50 pg/mL kanamycin, and 34 wug/mL 
chloramphenicol. 


3.3 Purifying 
Synthetic Protease 
Switches 


10. 


11. 


12. 


13. 
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. Grow cells for 72 h at 30 °C at 320 rpm until an OD 609 of ~10 


is typically reached. Store 96-deep-well plates at 4 °C for sev- 
eral days to regrow plasmids in step 11. 


. Optional: Spin down 200 pL cell suspension and resuspend 


cells in fresh PA-5052 autoinduction medium, but with no 
trace metals (see Note 6). 


. Transfer 100 pL aliquots of E. coli cell cultures from half a 


96-well plate (48 samples) and dilute twofold in 100 pL autol- 
ysis medium and incubate for 90 min at 30 °C with agitation 
at 200 rpm. 


. Transfer 10 pL aliquots of cell lysates and duplicate into sepa- 


rate halves of a black 96-well plate filled with 140 pL protease 
assay buffer. 


. Supplement all of the duplicates with 1 pL of 1 U/pL throm- 


bin to cleave the linker connecting TVMV to its Al-domain 
and incubate for 30 min. 


. Initiate the reaction by adding 50 pL of 20 pM TVMV peptide 


substrate solution giving rise to 5 uM TVMV peptide substrate 
in the final reaction. 


Monitor the reaction progress in a 96-well-plate fluorescence 
reader by measuring the release of the quenched fluorophore 
7-methoxycoumarinyl-4-acetyl at 405 nm following excitation 
at 330 nm. 


Evaluate the induction of TVMV protease activities by com- 
paring the time-course of the reaction in the presence and 
absence of thrombin. 


Regrow E. coli from step 3 that code for protease transducer 
mutants that display a high induction ratio in step 9 in 5 mL 
LB medium supplemented with 100 pg/mL carbenicillin. 


Purify and sequence plasmid DNA that codes for the protease 
transducer mutants with the desired properties. 


To characterize engineered protease switches biochemically under 
defined reaction conditions, it is recommended to purify indi- 
vidual protease switches to homogeneity by means of affinity 
chromatography. 


1. 


Transform plasmids coding for the desired synthetic protease 
switch into chemically competent BL21(DE3)-RIL cells, plate 
onto LB agar plates supplemented with 100 pg/mL carbenicil- 
lin, and 34 pg/mL chloramphenicol and incubate overnight at 
a7 °C. 


. Inoculate 250 mL TB-based autoinduction medium supple- 


mented with 100 pg/mL carbenicillin and 34 pg/mL chlor- 
amphenicol with a single colony. 


210 Viktor Stein and Kirill Alexandrov 


3.4 Quantification 
of Synthetic Protease 
Switches 


3.4.1 Determine 

the Induction 

of Autoinhibited Protease 
Transducers and Receptors 


10. 


ll 


. Grow cells over the course of 48 h at 30 °C at 200 rpm express- 


ing the protein by means of autoinduction (see Note 7). Cells 
usually reach OD oo values of ~14. 


. Harvest cells by centrifugation at 4500 x g and wash once with 


500 mL PBS. At this stage, cells can either be stored at —20 °C 
or directly processed. 


. Resuspend cells in washing and binding buffer and lyse using a 


one-shot cell disruptor. 


. Centrifuge samples at 25,000 x g and pass the supernatant 


through a 0.25-pm nitrocellulose filter to remove any large- 
scale debris and genomic DNA that could clog up the affinity 
purification column in the subsequent step. 


. Load the lysate onto 5 mL HisTrap FF Crude Ni-NTA col- 


umns equilibrated with washing and binding buffer. 


. After loading the cell lysate allow the His,.-tagged protease 


switch to bind to the column and then wash the column with 
40 column volumes of washing and binding buffer. 


. Elute the His,-tagged protease switch with an imidazole gradient 


from 20 to 500 mM over 40 column volumes. Throughout the 
procedure maintain a flow rate of 5 mL/min. The His,-tagged 
protease switch typically elutes around 100 mM imidazole. 


Pool fractions that contain the His.-tagged protease switch, 
concentrate the pooled fractions using centrifugal filters with a 
10 kDa cutoff and transfer into storage buffer with a suitable 
concentration of NaCl by means of gel filtration using dispos- 
able PD-10 desalting columns (see Note 8). 


. Synthetic protease switches can either be assayed directly or 


stored in aliquots at —80 °C. Aliquoting samples and flash- 
freezing in liquid nitrogen is highly recommended to prevent 
deterioration of the protein sample over successive freeze-thaw 
cycles. 


Once purified, individual protease receptors can be assayed and 
characterized under defined reaction conditions: e.g., to assess 
their maximum induction ratios, determine the dissociation con- 
stants of allosteric protease receptors for their cognate ligands, 
measure the inhibition constants of Al-domain peptides, or assess 
the purity of protease transducers by means of kinetic analysis. The 
preferred assay reaction volume is 200 pL while individual assay 
components are preferably added in increments of 50 pL based on 
4x working stock solutions of individual reagents. 


1. 


To assess the maximum induction ratio of elementary autoin- 
hibited protease transducers, preincubate 10 1M of T[VMV"™-AI 
or 10 pM of HCV'YMY-AT for 10 min in 100 pL protease assay 
buffer in the presence and absence of an activating protease 
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target (ie., for TVMV"™-AI include 0.005 U/pL thrombin 
and for HCV'Y™Y-AT include 500 nM TVMV). Typical reac- 
tion volumes are 50-100 pL supplemented with 1-2 pL of 
1 U/mL thrombin. 


. Mix 100 ul protease assay buffer with 50 pL of 2 pM of 
TVMV™-AI or 2 pM of HCV'YMY-AT as pretreated + throm- 
bin or TVMV from step 1. Generally perform dilutions in pro- 
tease assay buffer. 


. Alternatively, to assess the maximum induction ratio of alloste- 
ric protease receptors upon binding their cognate ligands, pre- 
incubate 50 pL of 1 pM TVMV-FN3-PDZ-AI together with 
50 pL of protease assay buffer and 50 pL of 4 uM ligand B1 or 
40 uM ligand B2 (see Note 9). 


. Alternatively, to assess the Kp value of allosteric protease recep- 
tors for their cognate ligands, preincubate 50 pL of 40 nM of 
TVMV-EN3-PDZ-AI with 50 pL protease assay buffer and 
50 pL of varying concentrations of either ligand B1 or ligand 
B2. To reliably determine the Ky, the highest concentration of 
ligand B1 or ligand B2 in the final reaction should be at least 
100 times the anticipated Kp (see Note 9). 


. Depending on the type of protease switch, initiate the reaction 
by adding 50 pL of either TVMV or HCV protease peptide 
substrate giving rise to a final concentration of 5 pM peptide 
substrate in a final reaction volume of 200 pL. Generally 
include a no enzyme control in 200 pL protease assay buffer to 
account for excitation-dependent bleaching of the fluorophore 
in the protease peptide substrate. 


. Monitor the reaction progress using a fluorescence multiwell plate 
reader by monitoring the release of 7-methoxycoumarinyl-4-acetyl 
from the quenched protease peptide substrate with Kex/em of 330 
and 405 nm. Synthetic protease switches based on TVMV or 
HCV at a concentration of 100 nM or more generally yield a good 
fluorescence signal over the course of 90 min. In contrast, syn- 
thetic protease switches assayed at a concentration of 10 nM or 
less are preferably resolved over the course of 900 min. To pre- 
vent excessive bleaching of the fluorophore, no more than 45 
time points should be measured under each condition. For 
prolonged periods of time, it is also recommended to cover the 
96-well plate with optical adhesive film to prevent evaporation 
of the reaction assay. 


. When assessing maximum induction ratios, extract initial rates 
from the change in fluorescence (after subtracting the back- 
ground fluorescence with no enzyme). Then calculate the 
induction ratio of individual TVMV- and HCV-based protease 
switches in the inactive and active states by dividing the initial 
rate in the presence and absence of the target analyte. 
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8. When performing ligand titrations, extract initial rates from the 
change in fluorescence (after subtracting the background fluo- 
rescence with no enzyme) and plot against the ligand concentra- 
tion. Then determine the apparent Kps of allosteric protease 
receptors by a nonlinear regression fit of the curve to Eq. 1. 


([Sensor] +[Ligand]+ K,, ) 


| ([Sensor] +[Ligand]+K,, y — 4.x|Sensor] [Ligand ]) 


3.4.2 Kinetic 
Characterization 

of Synthetic Protease 
Switches 


Ve 


2x [Sensor] 


1. Analysis by SDS-PAGE frequently cannot resolve prematurely 
truncated or partially cleaved synthetic protease transducers 
considering the small size of 10-15 amino acids that the AI- 
domain and the short His, affinity purification tag comprise at 
the C-terminus of both the TVMV- and HCV-based transduc- 
ers. In addition, any cleavage and truncation products will in 
the majority of cases be heterogeneous and not concentrate in 
a single band, but spread out, rendering their detection even 
more difficult. Instead, the quality of a purified protein prep 
can be assessed indirectly by determining the kinetic parame- 
ters of purified protease switches in their autoinhibited and 
activated states. Notably, any residual activity associated with 
partially autoinhibited protease switches should display elevated 
Ky compared to in its inactive state. 


2. Preincubate 10 pM of TVMV"""-AI in the presence and absence 


of 0.05 U/pL thrombin for 1 hin protease assay buffer. Typical 
reaction volumes are 50 to 100 pL supplemented with 1-2 pL 
of 1 U/L thrombin. 


3. Set up a dilution range of varying concentrations of the prote- 
ase substrate TVMV-DD in 100 pL protease assay buffer in 
black 96-well plates. For the reliable, quantitative determina- 
tion of Ky; values, set up duplicates for every substrate concen- 
tration. The highest substrate concentration in the final assay 
should be at least three times the anticipated Ky, for the prote- 
ase substrate. For instance, in case of fully active TVMV with 
an anticipated Ky of 80 1M, set up substrate concentrations 
from 5 to 400 pM based on 100 pL of a 2x TVMV-DD work- 
ing stock solution. 


4. In addition, prepare an equivalent dilution range of the prote- 


ase substrate TVMV-DD in 200 uL protease assay for a non- 
enzyme control to account for excitation-dependent bleaching 
of the fluorophore. 


5. Initiate the reaction by adding a suitable amount of TVMV- 
based transducer in a total volume of 100 pL. The optimal 
amount of TVMV will depend on the activity state of individ- 


3.4.3 Assessing 
the Strength of Al-Domain 
Binding 
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ual TVMV switches, as assessed in Subheading 3.4.1. For 
instance, for fully active TVMV switches, 100 nM is suitable 
while for largely predominantly autoinhibited TVMV switches, 
10 uM is required to reliably measure the initial rate. 


. Monitor the reaction progress using a fluorescence 96-well- 


plate reader by monitoring the release of 
7-methoxycoumarinyl-4-acetyl from the quenched TVMV 
peptide substrate with A... of 330 and 405 nm. Monitoring 
the reaction progress over the course of 90 min is usually suf- 
ficient to reliably extract data for quantitative analysis. 


. Extract initial rates from the change in fluorescence (after sub- 


tracting the background fluorescence in the no enzyme control 
at each substrate concentration) and plot against the substrate 
concentration. 


. Calculate the kinetic parameters by nonlinear regression fit to 


Michaelis Menten kinetics, as depicted in Eq. 2. 


[Substrate | 


cae See calcio 
Vatos * [Substrate]+ K,, 


(2) 


. For autoinhibited TVMV protease transducers, the Ky should 


increase relative to the activated transducer. If not, this means 
the majority of residual activity associated with a particular 
protein prep is due to prematurely truncated or cleaved 
protein. 


. To estimate the strength of binding interaction ofan Al-domain 


to its cognate protease transducer, preincubate 50 pL of 2 WM 
uninhibited TVMV"?"! Y?16W-transducer mutant with 50 pL 
protease assay buffer and 50 pL varying concentrations of the 
Al-domain peptide NHj;-EYVRFAPGST-COOH. For the reli- 
able, quantitative determination of K, values, set up duplicates 
for every Al-domain peptide concentration. The optimal con- 
centration range needs to be determined empirically with the 
highest concentration of the AI-domain peptide at least three 
to four times higher than the anticipated K.. 


. Initiate the reaction by adding 50 pL of 20 pM TVMV-DD 


substrate solution (to a final assay concentration of 5 .M) in 
the presence of varying concentrations of the Al-domain pep- 
tide NH,-EYVRFAPGST-COOH. Monitoring the reaction 
progress over the course of 90 min is usually sufficient to reli- 
ably extract data for quantitative analysis. 


. Monitor the reaction progress using a fluorescence 96-well-plate 


reader by monitoring the release of 7-methoxycoumarinyl-4-acetyl 
from the quenched TVMV-DD peptide substrate with fx /em Of 
330 and 405 nm. 
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3.5 Assembling 
Integrated Signal 
Sensing 
and Amplification 
Circuits 


3.5.1 Integrated 
Signaling Sensing 
and Amplification Circuits 


4. Extract initial rates from the change in fluorescence (after sub- 
tracting the background fluorescence in the no enzyme control 
at each substrate concentration) and plot against the concen- 
tration of the Al-domain peptide. 


5. Determine the K;, by a nonlinear regression fit of the curve to 
Eq. 3 with the Ky; and the TVMV-DD substrate concentration 
set to 65 pM and 5 pM for TVMYV, respectively (see Note 10). 


r=V [ Substrate] 
= Max x 


[Substrate]+ K,, x [ + ee) 


1 


Signaling switches based on autoinhibited protease modules carry 
the advantage that they can be readily assembled into synthetic 
protease circuits where a primary sensor can be connected to cleave 
and activate a secondary amplifier based on an alternative autoin- 
hibited protease module. This opens the possibility to assemble 
individual protease sensors and transducers into integrated signal 
sensing and amplification circuits with accelerated response times 
and improved sensitivity for their target analyte. 


1. Incubate 50 pL of the 40 nM SH3?°"-TVMV-FN3-PDZ-AI 
allosteric receptor with 50 pL of 20 pM HCV peptide sub- 
strate (final assay concentration of 5 pM) and 50 pL of varying 
concentrations of either ligand B1 or ligand B2. The optimal 
concentration of the SH3?°"-TVMV-FEN3-PDZ-AI allosteric 
receptor needs to be determined empirically and is preferably 
chosen in the low nM range to limit excessive, nonspecific acti- 
vation of the secondary amplifier HCOV'YY-AI-SH3?*. 
Similarly, the strength of the SH3-dependent binding interac- 
tion can be optimized by employing SH3-binding peptides of 
different interaction strength [16]. The concentrations of 
ligand B1 and ligand B2 can be chosen as in Subheading 3.4.1. 


2. Initiate the reaction by adding 50 pL of 800 nM HCV!YMY- 
AI-SH3"? giving rise to a final concentration of 200 nM 
HCV'Y™Y-AI-SH3??. 


3. Monitor the progress of the reaction using a fluorescence 
96-well-plate reader by monitoring the release of 
7-methoxycoumarinyl-4-acetyl from the quenched HCV 
peptide substrate with A.x/em of 330 and 405 nm. 


4. For the quantitative analysis of Kp values, extract initial rates 
from the change in fluorescence (after subtracting the back- 
ground fluorescence in the no enzyme control at each substrate 
concentration) and plot against the concentration of ligand B1 
or ligand B2. Crucially, for cascaded systems, the initial rates 
need to be extracted from the exponential phase before all of 


3.5.2 Proximity Sensors 
for Protein Interactions 


4 Notes 


1. 
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the secondary amplifier HCV'Y’Y-AI-SH3?® has been cleaved 
and activated to accurately determine the strength of binding 
interactions. 


Optional: Preincubate 50 pL of 10 pM FKBP12-TVMV"™"-AI 
with | pL of 1 U/pL thrombin to cleave off the AI domain. 
For reduced signal-to-noise ratios, the AI domain can remain 
attached (see Note 11). 


. Incubate 50 pL of 400 pM FKBP12-TVMV""-AI together 


with 50 pL of 20 uM HCV substrate (giving rise to a final con- 
centration of the HCV substrate 5 4M) and 50 pL of varying 
concentrations of rapamycin. With an anticipated Ky in the 
sub-nM range for the FKBP12-rapamycin interaction, the pre- 
ferred concentration of the primary FKBP12-TVMV™-AI 
sensor in the final assay is 100 pM to prevent titrating the 
amount of FKBP12-TVMV™"-AI and not the strength of the 
FKBP12-rapamycin binding interaction. Similarly, to reliably 
determine its Ky for FKBP12, rapamycin is applied between 
0.01 and 10 nM. 


. Initiate the reaction with 50 pL of 800 nM HCV!'YMY-AT- 


FRB. The preferred concentration of the secondary amplifier 
HCV'Y™Y-AJ-FRB in the final assay is 200 nM, but can be 
optimized empirically depending on the signal-to-noise ratio 
and the desired response rate. 


. Monitor the reaction progress using a fluorescence 96-well- 


plate reader by detecting the release of 7-methoxycoumarinyl- 
4-acetyl from the quenched HCV peptide substrate with A. /em 
of 330 and 405 nm. 


. For the quantitative analysis, extract initial rates from the change 


in fluorescence (after subtracting the background fluorescence in 
the no enzyme control at each substrate concentration) and plot 
against the substrate concentration. Crucially, for cascaded sys- 
tems, the initial rates need to be extracted from the exponential 
phase before all of the secondary amplifier HCV'Y“Y-AI-FRB has 
been cleaved and activated to accurately determine the strength 
of FKBP12-rapamycin-binding interaction. 


. Depending on specific protease switch being assayed, the NaCl 


concentration in the protease assay buffer needs to be opti- 
mized and tailored toward each application. For instance, 
SH3”°"-TVMV-FN3-PDZ-AI allosteric receptors display 
higher induction ratios at 1 M NaCl compared to 100 mM 
NaCl. In contrast, the activity of HCV-based switches is 


216 


Viktor Stein and Kirill Alexandrov 


inhibited at higher NaCl concentrations and should be kept at 
100 mM NaCl or less. 


. For the reliable determination of enzyme kinetics, the protease 


substrate needs to be fully dissolved at high substrate concen- 
trations. Ifa protease substrate is not fully soluble at the desired 
substrate concentration and the substrate solution appears 
cloudy, its solubility can be improved by appending negatively 
charged residues. 


. In the construction of DNA libraries, it is recommended to 


purify all DNA fragments for the DNA assembly reaction by 
means of agarose gel electrophoresis. In our experience, the 
Wizard SV Gel and PCR Clean-Up System kit provides the 
highest quality of DNA fragments free of impurities that can 
potentially interfere with subsequent enzymatic or transforma- 
tion steps. If necessary, individual DNA fragments can be addi- 
tionally cleaned by means of ethanol precipitation. 


. Alternatively, suitable vector backbones can be prepared by 


means of PCR independent of any restriction site. Due to the 
large size of the pRK793 backbone and the difficulties associ- 
ated with PCR amplification and susceptibility to point muta- 
tions, preparing the vector backbone by restriction enzyme 
digest is preferred in this case. 


. Plasmid 05665 features a bacteriophage A derived SRRz autoly- 


sis cassette [9, 17] under the control of a tetracycline inducible 
promoter. This allows releasing the recombinantly expressed 
protease switches into the lysate under relatively mild condi- 
tions. The use of detergent-based cell lysis procedures is not 
recommended as it interferes with the protease assay. 


. The induction of autolysis by means of the SRRz autolysis cas- 


sette with tetracycline is most efficient in fresh PA-5052 auto- 
induction medium. In addition, the presence of glucose, which 
inhibits the expression of genes under the control of the LacO 
promoter, in fresh PA-5052 autoinduction medium is irrele- 
vant at this stage as the protease transducer is already expressed. 
Depending on the type of protease transducer, including trace 
metals may interfere with protease function and should there- 
fore be omitted. 


. Alternatively, proteins can also be expressed by means of 


Isopropyl B-D-1-thiogalactopyranoside (IPTG). In this case, 
cells are grown to an ODg6o0 of 0.6—-0.8 before being induced 
with 0.1-1 mM IPTG and expressed for 3-12 h at 30 °C. 
Optimal induction conditions need to be determined empiri- 
cally for each type of protease sensor. 


. Depending on the synthetic protease sensor, it is critical to 


include the correct amount of NaCl in the storage buffer. 
Notably, the allosteric protease receptors are not stable in the 
absence of NaCl, but form a precipitate. 
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9. To measure the maximum induction ratio, the binding interaction 
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Chapter 14 


Characterizing Dynamic Protein-Protein Interactions Using 
the Genetically Encoded Split Biosensor Assay Technique 
Split TEV 


Jan P. Wintgens, Moritz J. Rossner, and Michael C. Wehr 


Abstract 


Dynamic protein-protein interactions (PPIs) are fundamental building blocks of cellular signaling and 
monitoring their regulation promotes the understanding of signaling in health and disease. Genetically 
encoded split protein biosensor assays, such as the split TEV method, have proved to be highly valuable 
when studying regulated PPIs in living cells. Split TEV is based on the functional complementation of two 
previously inactive TEV protease fragments fused to interacting proteins and provides a robust, sensitive 
and flexible readout to monitor PPIs both at the membrane and in the cytosol. Thus, split TEV can be 
used to analyze interactomes of receptors, membrane-associated proteins, and cytosolic proteins. In par- 
ticular, split TEV is useful to assay activities of relevant drug targets, such as receptor tyrosine kinases and 
G protein-coupled receptors, in compound screens. As split TEV uses genetically encoded readouts, 
including standard reporters based on fluorescence and luminescence, the technique can also be combined 
with scalable molecular barcode reporter systems, allowing the integration into multiplexed high- 
throughput assay approaches. Split TEV can be used in standard heterologous cell lines and primary cell 
types, including neurons, either in a transient or stably integrated format. When using cell lines, the basic 
protocol takes 30-96 h to complete, depending on the complexity of the experimental question addressed. 


Key words Protein-protein interaction, Split TEV, Biosensor, Split biosensor assay, RTK, GPCR, 
Phosphorylation-dependent interactions, Dose-response assay, Compound profiling 


1 Introduction 


1.1 Genetically Studying protein-protein interactions (PPIs) is paramount to 
Encoded Biosensor understand cellular signaling and responses thereof. The majority 
Assays for Studying of PPIs are precisely regulated both in time and space, and mediate 
Protein-Protein distinct cellular activities, including differentiation, proliferation, 
Interactions apoptosis, and inflammation. In terms of signaling, cell surface 


receptors and highly interlinked intracellular proteins, or hubs, are 
of special interest, as these proteins integrate key signaling activities 
to regulate global cellular responses. For cytosolic proteins, pivotal 
associations with a strong implication on downstream signaling are 
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frequently found at sub-membranous localizations. Here, signal 
transduction activities are commonly initiated through the dynamic 
formation of protein complexes that are composed of multiple 
PPIs [1]. For cell surface receptors, receptor tyrosine kinases 
(RTKs) and G protein-coupled receptors (GPCRs) are intensively 
studied protein classes, as deregulated signaling caused by these 
receptors is implicated in various human diseases, such as cancer 
and neurodevelopmental diseases [2, 3]. RT'Ks and GPCRs repre- 
sent major drug targets, supporting their significant role in clinical 
research and drug discovery [4]. Both the associations between 
two proteins as well as the activities of RTKs and GPCRs can be 
reliably studied in split biosensor assays (SBA, also called protein 
complementation assays [5]). SBA are particularly suitable to mon- 
itor interactions of cell surface receptors and cytosolic proteins 
with a sub-membranous localization as SBA facilitate an analysis in 
the natural habitat of these proteins [6]. 

RTKs are single transmembrane receptors, with a total number 
of 58 encoded in the human genome [3]. RTKs respond, with few 
exceptions, to extracellular cues, and have an extra-cellular ligand- 
binding domain, a cytosolic tyrosine kinase domain, and a cytosolic 
tail, which contains interaction motifs that establish, once phos- 
phorylated through ligand-driven activation, docking sites for 
cytosolic adapter proteins. Upon ligand binding, RIKs can 
homodimerize or heterodimerize, leading to a conformational 
change that transmits the signal into cytosol, where the kinase 
domains phosphorylate target tyrosine residues in trans [7]. In 
turn, Src homology 2 (SH2) domain- and phosphotyrosine bind- 
ing (PTB) domain-containing adapters are recruited to phosphor- 
ylated docking sites, a process that can be robustly and precisely 
monitored by SBA [8-10]. An established interaction between an 
RTK and an adapter protein initiates downstream signaling activi- 
ties, as these adaptors link RTK activation to downstream signal 
transduction pathways, such as the MAP kinase or PI3K/AKT sig- 
naling cascades. 

GPCRs are the largest class of receptors encoded in the human 
genome, comprising more than 800 receptors in total [11]. Like 
RTKs, deregulated GPCR signaling is strongly implicated in vari- 
ous human diseases. Strikingly, 30-40% of all marketed drugs tar- 
get GPCRs, making them the largest class of druggable receptors 
supporting their prominent role in drug discovery [4, 12]. GPCRs 
are seven-transmembrane receptors with three intracellular and 
three extracellular loops of varying length, an extracellular 
N-terminus, and a cytoplasmic C-terminal tail, which can associate 
with effector proteins in an activity-dependent manner [12]. 
Heterotrimeric G proteins are the major binding partners for 
GPCRs and trigger defined pathway responses, such as cAMP- 
mediated or Calcium-dependent pathway activities. Prolonged 
activation of GPCR activation results in its desensitization, which 
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is induced by the regulated recruitment of G protein-coupled 
receptor kinases (GRKs) that phosphorylate residues on the C ter- 
minus of the GPCR and thus provide a platform for B-Arrestin 
binding [13]. The binding of f-arrestin sterically obstructs G pro- 
tein coupling and triggers the internalization of the GPCR. The 
regulated association between a given GPCR and f-arrestin2 can 
be exploited to reliably monitor GPCR activities and is regularly 
used by SBA and related proximity assays [14-17]. 

To assess dynamic interactions among proteins, various 
approaches for genetically encoded SBA are available, including, 
but not limited to, GFP and variants thereof [18-20], luciferases 
(firefly [21, 22], Renilla |23, 24], Gaussia [25], click beetle lucif- 
erases [17, 26, 27]), B-lactamase [28], B-galactosidase [29, 30], 
ubiquitin [8, 31, 32], and the tobacco etch virus (TEV) protease 
[33]. In addition to SBA, proximity assays were developed, such as 
bioluminescence resonance energy transfer (BRET) assays com- 
prising firefly luciferase and GFP [34, 35] and the Tango method 
that uses a full-length TEV protease fused to an interacting partner 
like B-Arrestin to monitor GPCR activities [16]. 

Here, we present a detailed protocol for the split TEV tech- 
nique, a genetically encoded SBA that can be flexibly applied to 
sensitively monitor various types of regulated PPIs in living cells 
[33] (and reviewed in [6]). The technique is based on the 
interaction-induced fragment complementation of the TEV pro- 
tease. Split TEV can be used to assess dynamic interactions at the 
membrane and in the cytosol, thus allowing various combina- 
tions among potentially interacting candidate proteins (Fig. 1). 
Notably, interactions in the nucleus cannot be monitored using 
the presented technique, as the method was specifically designed 
to analyze PPIs at the membrane and in the cytosol. The tech- 
nique integrates various reporter systems, including fluorescent 
and luminescent reporters, making it implementable to many 
research laboratories. As split TEV flexibly allows using readouts 
of choice, scalable transcriptional barcode reporters that are ame- 
nable to multiplexed high-throughput formats and next-genera- 
tion sequencing may also be used. Combining these technologies 
will enable assessing drug target activities and cellular response 
profiles in parallel, thereby opening up new avenues in drug dis- 
covery. Here, we centre on luciferase reporters, as these are widely 
used both in single interaction assays and in high-throughput 
applications. 

For split TEV assays, an optimized form of the TEV protease 
is dissected into an N-terminal (NTEV, amino acids 1-118) anda 
C-terminal fragment (CTEV, amino acids 119-221, with a Ser- > 
Pro substitution at residue 219 for enhanced assay stability). The 
S$219P mutation renders the TEV protease refractory to autoca- 
talysis [36]. In addition, the optimized form is truncated after 
amino acid 221 to remove the inhibitory C-terminal tail, which 
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Fig. 1 Types of protein-protein interactions that can be monitored by split TEV. The candidates in a split TEV 
assay can be membrane proteins, membrane-associated proteins, and soluble proteins in the cytosol. Proteins 
localized to these subcellular areas can be tested for regulated interactions in all combinations, which are (a) 
membrane | membrane, (b) membrane | membrane-associated, (c) membrane | cytosolic, (d) membrane- 
associated | membrane-associated, (e) membrane-associated | cytosolic, and (f) cytosolic | cytosolic 


1.2 Split TEV Assays 
for Membrane 

and Membrane- 
Associated Proteins 


blocks the active site of the TEV protease [37]. NTEV and CTEV 
fragments are fused to candidate proteins, preferably to the 
C-terminal end of the protein candidates. For NTEV, N-terminal 
fusions are also functional. Protein Interaction-induced reassembly 
of the NTEV and CTEV moieties leads to the reconstitution of 
TEV proteolytic activity, which activates TEV-specific reporters. 
These can be either of proteolytic or of transcriptional type, and 
they maybe fluorescent or luminescent-based. Notably, the 
luciferase-based transcriptional reporters proved to be most sensi- 
tive and robust as the readout is functionally uncoupled from the 
interaction event itself and background readings were commonly 
lower compared to other options. Transcriptional reporters based 
on luciferase exhibit another feature as they comprise three levels 
of signal amplification: In addition to the proteolytic cleavage and 
a transcriptional amplification step, enzyme-based (i.e., luciferase) 
reporters allow a third step of amplifying the initial signal resulting 
in a robust and sensitive readout. 


For monitoring dynamic PPIs at the membrane, a membrane pro- 
tein or membrane-associated protein candidate is fused to the 
NTEV fragment along with a TEV protease cleavage site (tevS, 
encoded by the amino acids ENLYFQ’G; the TEV protease cleaves 
between Q and G) and the artificial transcriptional co-activator 
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Fig. 2 Schematic representation of split TEV assays. (a) Split TEV assays for membrane and membrane- 
associated proteins. A candidate receptor is fused to the NTEV fragment, followed by the TEV protease cleav- 
age site (tevS) and the co-transcriptional co-activator GAL4-VP16 (GV). A candidate soluble protein is fused to 
the CTEV fragment. Upon ligand stimulation, the receptor and the soluble protein associate with each other, 
bringing the TEV fragments into close proximity. This results in the reconstitution of TEV proteolytic activity, 
which is indicated by the outer glow shown in b/ue at the NTEV and CTEV fragments. Regained TEV protease 
activity cleaves at the tevS to release GV, as indicated by the scissors. Liberated GV translocates to the nucleus 
to transcriptionally activate a firefly luciferase reporter gene (Fluc) through binding to upstream activating 
enhancer sequences (UAS). (b) Split TEV assays for soluble proteins. A candidate soluble protein is fused to 
NTEV, a second soluble candidate is fused to CTEV. A cytosolic TEV reporter composed of a central GV unit that 
is trapped and flanked by ERT2 domains, each fused via a tevS, is additionally present in the cell. Association 
between the two candidates causes the TEV proteolytic activity to be reconstituted. Regained TEV protease 
activity cleaves at both tevS (indicated by the scissors) to liberate GV, which travels into the nucleus to activate 
the firefly luciferase reporter 


GAL4-VP16 (GV), resulting in a NTEV-tevS-GV tag (Fig. 2a). 
As this hybrid tag contains a transcriptional co-activator, it is criti- 
cal that the candidate protein is a membrane or membrane-associ- 
ated protein, as a nuclear localization will readily lead to high 
background readings due to increased GV activity. The second 
candidate protein is fused to the CTEV fragment. This candidate 
may be a membrane, membrane-associated, a cytosolic protein, or 


224 Jan P. Wintgens et al. 


1.3 Split TEV Assays 
for Soluble Proteins 


1.4 Assay Controls 


even a protein that shuttles between the cytosol and the nucleus. 
PPI-induced reconstituted TEV protease activity cleaves off GV, 
which translocates into the nucleus and activates a firefly luciferase 
reporter gene, which is placed under the control of upstream acti- 
vating sequences that contain DNA sequences recognized by GV. 

Split TEV assays can be used to monitor ligand-dependent 
interactions of cell surface receptors, such as RITKs and GPCRs, 
thus allowing to assess their activity. For instance, the Neuregulin1- 
induced association between ERBB4, an RTK of the ERBB family, 
and the regulatory subunit alpha of the PI3K, PIK3R1, was deter- 
mined in a dose-response assay (Fig. 3a), as well as its inhibition by 
lapatinib (Fig. 3b). Sample constitutive interactions assayed using 
split TEV are shown for the membrane-associated protein KIBRA 
(Fig. 3c). For assessing GPCR activities, we have recently reported 
a detailed protocol on the split TEV-based GPCR/f-Arrestin2 
recruitment assay, which uses a truncated version of B-Arrestin2 
and a modified NTEV-tevS-GV tag for improved sensitivity [38]. 
We therefore refer the reader to this specified GPCR split TEV 
protocol. Notably, the split TEV method proved to be more sensi- 
tive than the full- TEV Tango approach when assaying selected 
GPCR activities [15]. 


Interactions between candidate proteins that stay strictly cyto- 
solic and do not shuttle into the nucleus may be monitored using 
the hybrid NTEV-tevS-GV tag (see Note 1). For candidate pro- 
teins that show a certain degree of nuclear localization, candi- 
dates are fused to the NTEV fragment only (Fig. 2b). The other 
candidate protein is fused to CTEV. When pursuing the sole 
NTEV/CTEV tagging strategy, PPI-induced TEV protease activ- 
ity leads to the activation of a cytosolically localized transcrip- 
tional TEV reporter (denoted GV-2ER), which contains a central 
GV unit that is trapped by two ERT2 domains, each fused to GV 
via a tevS. ERT2 domains are modified oestrogen receptor 
domains, that do not respond to endogenous oestrogen, but to 
4-hydroxytamoxifen. Liberated GV translocates into the nucleus 
and activates the firefly luciferase reporter. MST1 dimer forma- 
tion based on the C-terminal SARAH domain is shown as an 
example for soluble protein interactions (Fig. 3d). Dynamic 
interactions like AKT-induced and phosphorylation-regulated 
BAD /14-3-3 and rapamycin-regulated FKBP/FRB have also 
been described using split TEV [9, 33]. 


When studying PPIs using split TEV, tagged proteins of interest 
are commonly introduced into cells at rather high expression lev- 
els, either transiently by transfection or by stable integration. 
Therefore, assay components should be expressed at rather low 
levels, possibly close to those of endogenous counterparts. 
Further, each assay should be individually validated by specific 
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Fig. 3 Examples of constitutive and regulated split TEV assays. (a, b) Dose response split TEV assays for the 
ERBB4 receptor using (a) the agonist EGF-like domain (EGFld) and (b) the antagonist lapatinib. ERBB4-NTEV- 
tevS-GV and PIK3R1-CTEV fusions were transiently transfected into PC12 cells and treated for 20 h using the 
indicated compound concentrations. (a) ERBB4/PIK3R1 -transfected assay cells were stimulated with increas- 
ing concentrations of EGFld. The EC; value was calculated at 3.06 ng/ml EGFld. (b) ERBB4/PIK3R1 -transfected 
assay cells were stimulated with increasing concentrations of lapatinib, followed 1 h later by the addition of 
a constant stimulus of 10 ng/ml EGFld. The IC;, value was calculated at 0.42 ,M lapatinib (b). Six replicates 
per condition, error bars represent SEM. (c) Constitutive split TEV assays for the kinase MST1. MST1 dimer- 
izes through its C-terminally located SARAH domain (/ane 2). Note that MST1-AC, which contains residues 
1-432 only and lacks the SARAH domain, does not interact with MST1 full-length, and thus only background 
signals are produced. Likewise, the missense mutation L444P disrupts the MST1 dimer formation [39]. (d) 
MST1 is not suitable for the hybrid tag NTEV-tevS-GV. MST1-NTEV-tevS-GV is cleaved and produces high 
background readings (/ane 4). By contrast, MST1 interacting proteins MST2, SAV1 and RASSF1A (R1A) yield 
low background readings. Readings are compared to a strong PPI (KIBRA::KIBRA) (/ane 7) and a background 
control (KIBRA::RASSF6 (R6)) (lane 2). RLU relative luciferase units; six replicates per condition, error bars 
represent SD 


interaction controls. Therefore, it is strongly advisable to include 
positive controls (i.e., proteins that do interact with the candidate 
of interest) and negative controls (i.e., proteins that do not bind 
to the candidate of interest) for each assay designed. 
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1.5 Experimental 


The split TEV technique can been applied to study constitutive 


Setups for Constitutive and regulated interactions that are modulated by ligands, ago- 


and Regulated 
Interaction Split TEV 
Assays 


nists, antagonists, or a combination thereof. Experimental setups 
for constitutive split TEV assays can be comfortably completed 
within 30 h (Fig. 4a). Dynamic interaction split TEV assays using 
agonists or agonist/antagonist combinations typically require up 
to 4 days, depending on the individual experimental design (cell 
seeding parameters, starving conditions, stimulation paradigms, 
etc.) (Fig. 4b, c). The effects of compound actions, either stimula- 
tory or inhibitory, are optimally assessed in dose response assays, 
which allow the calculation of ECs9 and ICs9 values (concentra- 
tions of agonist/antagonist compounds at half-maximal stimula- 
tory/inhibitory response). 

In summary, the split TEV technique is a powerful and sensi- 
tive tool providing a robust readout to monitor dynamic PPIs 
within their natural context. The split TEV method is character- 
ized by various features as it enables (1) the detection of dynamic 
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Fig. 4 Suggested timelines for split TEV assays. Cells are plated on the day before (or alternatively in the morn- 
ing) and transfected with assay plasmids. (a) Constitutive assay. After transfection, cells are incubated for 20 
h before lysis. (b) Agonist assay: After transfection, plasmids are allowed to express for 20 h, followed by a 
medium change to starve the cells in low-serum media. The following day, an agonist is added for 6-20 h, 
depending on the setup of the assay. (c) Antagonist assay: Transfection and medium change are performed as 
in (b). Before an agonist is added, cells are treated with an antagonist for 1 h. The stimulation time is depen- 
dent on the assay setup 


2 Materials 


2.1 Plasmids 
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interactions between full-length candidate proteins in living cells 
or in lysates thereof, (2) the detection of protein interactions 
localized at the membrane, sub-membrane compartment, and the 
cytosol, (3) the detection of protein interactions in heterologous 
mammalian cells, primary cell types including primary neurons 
and astrocytes, (4) the detection of regulated RTK and GPCR 
activities induced by extracellular stimuli, (5) the analysis of regu- 
lated, ligand or compound concentration-dependent interactions 
in dose-response assays assessing actions of agonists and antago- 
nists, and (6) the setup as protein interaction readouts for high- 
throughput screening (HTS) purposes [9, 15, 33, 40-42]. 
Therefore, split TEV may also be adapted to industrial HTS set- 
ups to screen for RTK and GPCR activities, for example in drug 
discovery programs. 

Below, we describe experimental setups for split TEV assays for 
membrane, membrane-associate and cytosolic proteins using firefly 
luciferase as a final reporter gene as this readout is widely used 
across laboratories, and allows adaptation to HTS approaches. 


1. For membrane, membrane-associated and strictly cytosolically 
localized proteins: 


ORF(X)-NTEV-tevS-GV plasmid. The ORF of candidate X is 
fused to NTEV (amino acids 1-118), tevS (ENLYFQG), 
and GV (Fig. 2a). Use molecular cloning to generate can- 
didate X fusion vectors (see Note 2). For a detailed descrip- 
tion on GPCRs, see the protocol by Wehr et al. [40]. 


For cytosolic protein candidates that shuffle between cytosol 
and nucleus: 


ORF(X)-NTEV plasmid. The ORF of candidate X is fused to 
NTEV (amino acids 1-118) only. In addition, the pGV- 
2ER plasmid is required as cytosolic TEV reporter (Fig. 2b). 


2. ORF(Y)-CTEV. The ORF of candidate Y is fused to CTEV 
(amino acids 119-221, including a S219P mutation). Use 
molecular cloning to generate the fusion vector (see Note 2). 


3. Firefly luciferase reporter plasmid. Use an upstream activating 
enhancer sequences (UAS)-driven firefly luciferase (e.g., 
pGL4.31[luc2P/GAL4UAS/Hygro], Promega, catalog no. 
C9351) (see Note 3). 


4. Renilla luciferase control plasmid. Use a thymidine kinase 
(TK)-driven constitutive Renilla luciferase as internal control 
(pRL-TK, Promega, catalog no. E2241). 


5. A GFP or GFP-derivative expressing plasmid for optical trans- 
fection control (e.g., mVenus-N1, Addgene, Plasmid #54640). 
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22 Cells HEK293 (ATCC), PC12 cells (PC12 Tet-Off, Clontech), any 
other heterologous cell line, or primary cells that are amenable to 
efficient transfection and allow studying protein-protein interac- 
tions (see Note 4). 


2.3 Reagents Is 


2.4 Equipment 1. 


6. 


Medium, depending on the cell line chosen (e.g., for HEK293 
cells: DMEM 4.5 g glucose/| without L-glutamine, each with 
and without phenol red). See complete media formulations for 
HEK293 and PC12 cells below. 


. Fetal bovine serum (FBS), heat-inactivated, 500 ml (Life 


Technologies) 


. Horse serum (HS), heat-inactivated, 500 ml (Life Technologies) 


(only needed for PC12 cells). 


. GlutaMAX, 100x, 100 ml (Life Technologies). 

. Penicillin and Streptomycin, 100x, 100 ml (Life Technologies). 
. Opti-MEM (Life Technologies). 

. Poly-t-lysine, mol wt 70,000-150,000, diluted in a final con- 


centration of 0.02 mg/ml in H,O (Sigma). 


. Lipofectamine 2000 transfection reagent (Life Technologies). 
. Passive lysis buffer, 5x (Promega, E1941). 
. Dual-Luciferase® Reporter 1000 Assay System (Promega, 


E1980) (for an alternative using self-made substrates, see 
Note 14). 


96-well plates: white with flat bottom for luciferase-based 
assays (Falcon). 


. 96-well plates: clear with flat bottom for optical transfection 


control and fluorescence-based assays (Falcon). 


. 75 cm? flasks (Falcon) or 15 cm dish (Falcon) (for mainte- 


nance of cell lines). 


. Cell culture facility including flow hood and incubator set to 


5% CO, and 37 °C. 


. Luciferase reader (e.g., Mithras, Berthold Technologies or 


Envision, PerkinElmer). 


Fluorescence microscope (to check for transfection efficiency). 


2.5 Media Prepare for each cell line used the appropriate media for mainte- 
Formulation nance and assay conditions. 


1. 


2. 


HEK293 cells, assay medium: DMEM (4.5 g glucose/I, with- 
out L-glutamine and without phenol red), 500 ml, supple- 
mented with 0.5% FBS and 2 mM GlutaMAX. 


HEK293 cells, medium for maintenance: DMEM (4.5 g 
glucose/l, without L-glutamine, with phenol red), 500 ml, 
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supplemented with 10% FBS, 2 mM GlutaMAX, and 100 U/ 
ml of each penicillin and streptomycin. 


. PC12 Tet-Off cells, assay medium: DMEM (1 g glucose/1, 


without L-glutamine and without phenol red), 500 ml, supple- 
mented with 1% FBS, and 2 mM GlutaMAX. 


. PC12 Tet-Off cells, medium for maintenance: DMEM (1 g 


glucose/l, without L-glutamine, with phenol red), 500 ml, 
supplemented with 5% FBS and 10% HS, 2 mM GlutaMAX, 
and 100 U/ml of each penicillin and streptomycin. 


. Design appropriate ORF(X)-NTEV-tevS-GV and OREF(Y)- 


CTEV fusion expression plasmids (see guidelines in Subheading 
2.1, items 1 and 2, and Note 2). When testing interactions 
between candidates X and Y, it is strongly advisable to design 
appropriate interaction controls (see Note 5). 


. Clone and prepare sufficient amount of DNA for the assays, set 


the concentration for each plasmid to 100 ng/l (see Note 6). 


. Prepare 96-well plates for split TEV luciferase assays and fluo- 


rescence controls. 


(a) Coat a white/clear 96-well plate(s) with a 0.02 mg/ml 
poly-L lysine (PLL) solution for 30 min at room 
temperature. 


(b) Wash twice with double deionized water, then let plates 
air-dry. 
PLL-coated plates can be stored at 4 °C for up to 6 weeks 
without losing coating efficiency. This step is optional for 
HEK293 cells. 


. Seed 20,000 HEK cells resuspended in maintenance medium 


without antibiotics into each well of the coated 96-well plate. 
Place the 96-well plate(s) into the incubator. For PC12 Tet- 
Off cells, seed 40,000 cells per well (see Note 7). 


. Seed cells onto a clear 96-well plate for an optical control of 


transfection efficiency. 


. On the next day, prepare the plasmid mixes for each condition 


to be tested. Run 6 replicates per condition to obtain sufficient 
readings for statistical analyses. For each 96-well, use 15 ng per 
single DNA. Make a master mix for 6 + 1 replicates, which 
contains an extra amount needed to allow for pipetting errors 
(for example, when using 15 ng DNA/well, use 15 ng x 7 = 
105 ng in total). For the Remzlla luciferase plasmid, use the 
same DNA amount chosen. Prepare a constitutively expressing 
GFP plasmid (or variant thereof) (2 ng per well) for control 
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10. 


11. 


12. 


1:3; 
14. 


15. 


16. 


17. 


positions on the clear plate to guarantee an optical control of 
transfection efficiency (see Note 8). 


. Prepare the transfection mix. Dilute the Lipofectamine 2000 


150-fold in Opti-MEM supplemented with 1% GlutaMAX, 
vortex for 1-2 s, and incubate for 5 min. For each 96-well, we 
use 0.2 pl Lipofectamine 2000 that is diluted in 30 pl Opti- 
MEM. We suggest preparing a 7x master mix for all conditions 
to be tested. 


. Pipette the Lipofectamine /Opti-MEM mix onto the DNA, 


mix (vortex), and incubate for 20 min at room temperature. 


. Remove medium from each well entirely (be careful not to 


remove the cells) and transfer the DNA/transfection mix (30 
pl per well) onto the cells. 


After 1 h, add 60 pl medium for maintenance without 
antibiotics. 


Depending on whether a constitutive or stimulus-dependent 
protein-protein interaction assay is run, 


e proceed to step 12 for constitutive, or 


¢ proceed to step 13 for agonist and agonist/antagonist 
assays. 


¢ Common experimental paradigms are schematically dis- 
played in Fig. 4. 


Constitutive assay: Incubate for 12-24 h. Typically, we incu- 
bate constitutive assays for 20 h. Before lysing the cells, verify 
the transfection efficiency using the clear optical control plate. 
By this time, the transfected plasmids should express consider- 
ably, and transfection efficiency, as determined by GFP signals, 
should be at least 30-50% to obtain stable assay conditions, 
otherwise start optimizing transfection parameters (see Note 9). 


e Proceed to step 17. 
Agomst and agonist/antagonist assays: Incubate overnight. 


On the next day, verify transfection efficiency (c.f. step 12, see 
Note 9) and replace the medium with 80 yl assay medium to 
induce the starvation of cells. 


After 16—24 h, add appropriate stimuli in 80 pl assay medium, 
with the agonists in double concentration. If studying ago- 
nist/antagonist effects, add the antagonists 1 h before the ago- 
nists (see Note 10). 


Incubate for 6-24 h, depending on the assay setup. This step 
strongly depends on the individual interaction measured, the 
cell type used, or biological question addressed. For a first 
approach, we incubate for 18-20 h (see Note 11). 


Aspirate the medium and add 30 pl of 1x Passive Lysis Buffer 
to each well. Incubate the 96-well plate on a horizontal rock- 
ing platform (100-150 rpm) for 10 min at room temperature. 


Table 1 


18. 


19. 


20. 
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Lysed cells can be stored at —20 °C for up to 4 weeks without 
losing significant levels of signal intensity (see Note 12). 


Perform the Dual luciferase readout. Add 30 ul of Firefly sub- 
strate into each well (from commercial Dual Luciferase Kit 
prepared as indicated by the manufacturer), read the plate 
using a luciferase reader. Add 30 pl of Renilla substrate into 
each well, read the plate (see Note 13). Alternatively, self-made 
substrates may be used (see Note 14). 


Calculate mean, standard deviations and standard errors of the 
mean using an Excel spreadsheet. Generate a bar graph to 
visualize the data. For dose-response analyses, we recom- 
mend using the dose response curve package ‘dre’ in R (see 
Note 15). Alternatively, the GraphPad Prism program can be 
used, which allows standard visualizations of dose response 
curves. 


Advice for troubleshooting addressing critical steps can be 
found in Table 1. 


Troubleshooting advice 


Step Problem Solution 
4,10 Low signals in general (both Do not use antibiotics as they might interfere with the 
firefly and Renilla) transfection reagent 
12,16 Good transfection efficiency, Although the optical transfection control is fine as 
good Renilla signals, but indicated by for example EGFP expression, the candidate 
low firefly signals: fusion in question may be poorly expressed 
e ORF-NTEV e Verify protein expression using Western blotting 
¢ ORF-NTEV-tevS-GV e Try to swap candidate proteins for NTEV and CTEV 
e ORE-CTEV fusion fragments. If using the cytoplasmic assay, you may also 
protein may not be place the NTEV moiety to either end. Protein 
expressed properly expression, however, also depends on the target protein 
chosen 
16 Low firefly signals for Stimulation time may vary for a given receptor or any other 
inducible PPIs inducible PPI. Therefore, we suggest to run an online 
luciferase split TEV assay to obtain the best activation 
window. If no appropriate reader is available, monitor 
assay activity from 6 to 20 h in an hourly or two-hourly 
mode after the agonist has been added 
18 Low luminescence signals, Double-check the functionality of the luciferase buffers. 


in particular for firefly 
luciferase 


This is of particular importance for the firefly luciferase 
buffer containing ATP, which can decay quickly. To do 
sO, we prepare test lysates from cells that constitutively 
express both firefly and Renilla luciferases. These lysates 
can easily be stored at —80 °C for several months 
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4 Notes 


1. 


When opting for the hybrid NTEV-tevS-GV tag, we recom- 
mend to first perform control assays that only contain the can- 
didate-NTEV-tevS-GV to exclude high background readings. 
For example, activated Sterile20-like kinase MST1 (STK4) is 
cleaved [43] and produces high background readings, thus 
precluding the use of the hybrid tag. In a next step, the candi- 
date may be tested with a protein that is known not to bind to 
the candidate to validate assay robustness, and to establish 
experimental parameters for a reference negative control. 


. Cloning of candidate split TEV fusions. Clone the candidate 


ORFs including all split TEV assay elements into a regular 
expression vector, such as pcDNA3.1. The fusions are con- 
structed to contain a 10 amino acid flexible linker (GGGGS 
GGGGS) between the candidate protein and NTEV or CTEV. 
As expression plasmids, we regularly use pcDNA3-derived 
(Invitrogen) and pTag-derived (Stratagene) vectors. If 
required, N- and CTEV fusions can be stably integrated into 
the cells using the mammalian selection markers present on 
these plasmids (i.e., neomycin or zeocin resistance). We regu- 
larly use Gateway recombination cloning (Life Technologies) 
to quickly obtain candidate fusions. Gateway-compatible ORFs 
for more than 16,000 ORFs are available at Plasmid ID 
(http://plasmid.med.harvard.edu/PLASMID/Home.jsp). 
The TEV protease mutant $219P can be obtained from 
Addgene (plasmid no. 8830, pRK792). 


. When opting for fluorescence-based assays, use a UAS-EGFP 


plasmid (i.e., pJFRC7-20XUAS-IVS-mCD8::GFP, Addgene, 
plasmid no. 26220). In this case, normalization can be done 
using nuclear staining or by co-transfecting a constitutively 
expressing mCherry plasmid. In addition, a fluorescence plate 
reader is required for quantification. 


. For investigating membrane protein interactions, we start test- 


ing assays in PC12 Tet-Off cells (Clontech) and then proceed 
to HEK293 cells (ATCC). For membrane-associated and solu- 
ble proteins, we commonly start with HEK293 cells. Notably, 
U20OS cells proved to a valuable cell line to monitor GPCR 
activities. As we observed substantial differences in the activa- 
tion ratios for some GPCRS, like AVPR1 and DRD2, and 
RIKs, like ERBB family receptors, we suggest to assess the 
optimal cell line for each individual assay. 


. Positive (known interacting protein) and negative (known 


non-interacting protein) controls should always be included in 
protein-protein interaction assays based on protein fragment 
complementation [33]. As a starting guideline, we suggest 


10. 


11. 


12. 


13. 


Characterising Dynamic PPls Using Split TEV 233 


using FKBP1A-NTEV or FKBP1A-CTEYV as negative controls 
as FKBP1A represents a small protein with a defined number 
of interactions described [44]. FKBP1A was used in split TEV 
assays [40], and also by other split protein approaches. Further, 
we have successfully used the GCN4-coiled coil domain 
(GCN4cc) as interaction control [9, 41]. Another strategy 
may be applying a truncated form of an interacting partner 
that lacks the defined interaction domain(s). However, it is 
crucial to validate its correct use first. 


. We suggest to store DNA plasmid aliquots at —20 °C. Avoid 


unnecessary freeze-thaw cycles as this decreases DNA concen- 
tration and quality. 


. Seeding the cells can be done on the previous day or in the 


morning of the day the experiment is planned. When seeding 
on the same day, increase the amount of cells by roughly 50% 
and wait at least 3-4 h to let the cells attach to the surface of 
the plate. 


. It is also possible to prepare the DNA mix on the previous day. 


If done so, freeze the DNA as small volumes may evaporate at 
os OF 


. To optimize transfection conditions, start with adapting plas- 


mid amounts, i.e., increase/decrease DNA amounts, and/or 
change ratios of the candidate X vs. candidate Y vs. reporter 
plasmids. Further, we find changing cell densities and/or 
Lipofectamine 2000 concentrations helpful. 


Agonists can be added in a 10x or 20x concentration diluted 
in assay medium. Note that both agonists and antagonists are 
frequently diluted in DMSO, which must not exceed 1% of the 
total volume. 


For RTKs, we regularly incubate for 20 h to obtain stable 
inducible readings. For GPCRs, we noted, however, that 
shorter incubation periods of 6-8 h yield better readings. To 
determine the optimal time window for a given inducible PPI, 
we also suggest to run an online luciferase split TEV assay 
before investing into a dose response assay. 


When opting for a fluorescent reporter, such as UAS_EGEP, 
do not lyse the cells, and proceed with acquiring the fluores- 
cent signal using a fluorescence reader or an automated 
microscope. 


When acquiring luciferase signal intensities using the Mithras 
device from Berthold Technologies, we apply a time integra- 
tion protocol, which measures each well for 2 s. Substrates are 
added by automated injection across the entire plate, followed 
by an orbital shake for 10 s and an incubation time of 10 min, 
followed by reading the signals. This process is performed first 
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14. 


for the firefly luciferase substrates, and then for the Renilla 
luciferase substrates. 


If using the self-made substrates for the Mithras device from 
Berthold Technologies, a different protocol for measuring the 
Renilla luciferase signals will be applied, as this substrate has a 
decreased stability. The substrate is injected, followed by an 
orbital shake of 2 s, and then immediately measured for 2 s. 
This Process is repeated well by well. The detailed protocol for 
making both firefly (a) and Renilla (b) substrates is listed 
below. We regularly order special reagents from p.j.k. GmbH, 
Kleinblittersdorf, Germany; these include coelenterazine, 
D-luciferin, co-enzyme A, ATP, DTT (1,4 dithiothreitol). 
c(0.001, 0.01, 0.1, 1, 10, 100), 


(a) Substrate for firefly luciferase 


Chemical Final concentration 
Tricine 20 mM 
(MgCO;3)s*Mg(OH).*5H,O 1.07 mM 
(magnesium 
carbonate-hydroxide-pentahydrate) 
MgSO, * 7 H,O 2.67 mM 
EDTA 0.1 mM 
DIT 33.3 mM 
Coenzym A 270 uM 
p-Luciferin, free acid 470 uM 
ATP 530 pM 


To solve magnesium carbonate, adjust the pH value using 
HCl (37%, use approximately 1 ml for 1 1) until solution 
becomes clear. Then, adjust the pH value to 7.8 using 5 M 
NaOH (use approximately 7.5 ml for 1 1). 


(b) Substrate for Renilla luciferase 


Chemical Final concentration 
NaCl 11M 

Na,-EDTA 2.2 mM 

KPO, (pH 5,1) 0.22M 

BSA 0.44 mg/ml 

NaN; 1.3 mM 


Coelenterazine (solved in EtOH) 1.43 mM 


15, 


using 2 M KOH (use approximately 15 ml for 1 1). Adjust pH 
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KPO, (pH 5.1): 1 M KH,POg,, adjust pH value to 5.1 


value to 5.0. Add coelenterazin afterwards. 


For storage, make 50 ml aliquots and store at —20 °C in 


the dark. Aliquots can be stored up to 6 months. 


The package ‘dre’ is used in R to calculate EC50 and IC50 
values and to generate fitted dose response curves. The script 
uses the drm function and is amended to include error bars 
based on the standard error of the mean (sem). The assay data 


is saved as txt file using a tab delimited format. Luciferase activ- 


ity (activity) is calculated in %, concentration (conc) in pM, 


and 


sem in %. 


A sample assay data set is shown below: 


ac 
Ts 
8. 
10 
7. 
12 
9 
95s 
13 
26 
31 
61 
70 
10 
95 


tivity 
024253731 
789256841 
- 69690609 
650808458 
- 68326337 
120219216 
865127488 
.02510883 
-97566853 
.31743626 
-8740283 
~77386505 
0 

- 9036847 


conc 
.00003 
-0001 
-0003 
O01 
003 
-O1 
.03 

il 

3 


ef oS So 2 oF Oo 2 2 © 


WwW 


10 
30 
100 


sem 


0. 


969091058 


-875653638 
1.271332669 
-991970081 
.117476168 
1.537416736 
1.367412567 
2.670494192 
3.160663768 
2.324396584 
3. 
9 

5 

1 


084176905 


. 107073314 
- 429347986 
3.04036002 


The dre script including the addition of error bars reads: 


Loading package 
library ("dre") 


Loading data 


dre <- read.delim ("D:/.../DRC_1.txt”, header=T, 


dec=".") 


dre 


summary (DR.1) 


Returning data 


LL.4()) 


Fitting the model 


DR.1 <- drm(activity~conc, 


data = 


drc, 


ECE 
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Calculating effective dos 


ED) values 


first column: the estimate 


ED90 


D(DR.1, c(50, 80, 90)) 


and 


s ED50, ED80, 


second column: the estimated standard errors 


Dout 


ED(DR.1, 


c(50)) 


Labelling of x axis, 


compound name & unit 


DRxlab 


Plotting 
activity 


plot(DR.1, 


,ylab = "activity[%3]", 
xt 


= Wott 
100) 
bty 


yl 


y 


EGFld 
& 10 ng/ml 


ALES 
ut[1,1],4),""M",sep=" 


[ng/ml]" 
EGFld" 


graph with y axis from 0 to 100% 


or "Lapatinib 


[uM] 


FALSE 


broken 


c("all") 
xlab DRxlab, log 
c(0.001, 0.01, 0.1, 1, 10, 
c(0, 115), legend FALSE, 
paste ("EC50: ",round(EDo 
")) #adjust EC/IC50 


, type 


im 


sub 


and concentration 


for (i in 1 


Adding error bars as sem 


:dim(drc) [1]) { 


segments (drc[i,2],drc[i,1]-drce[i,3],drc[i,2], 
drce[i,1]+drc[i,3],col="black") 
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Part VI 


Optogenetic Switches 


Chapter 15 


Development of a Synthetic Switch to Control Protein 
Stability in Eukaryotic Cells with Light 


Christof Taxis 


Abstract 


In eukaryotic cells, virtually all regulatory processes are influenced by proteolysis. Thus, synthetic control 
of protein stability is a powerful approach to influence cellular behavior. To achieve this, selected target 
proteins are modified with a conditional degradation sequence (degron) that responds to a distinct signal. 
For development of a synthetic degron, an appropriate sensor domain is fused with a degron such that 
activity of the degron is under control of the sensor. This chapter describes the development of a light- 
activated, synthetic degron in the model organism Saccharomyces cerevisiae. This photosensitive degron 
module is composed of the light-oxygen—voltage (LOV) 2 photoreceptor domain of Arabidopsis thaliana 
phototropin 1 and a degron derived from murine ornithine decarboxylase (ODC). Excitation of the pho- 
toreceptor with blue light induces a conformational change that leads to exposure and activation of the 
degron. Subsequently, the protein is targeted for degradation by the proteasome. Here, the strategy for 
degron module development and optimization is described in detail together with experimental aspects, 
which were pivotal for successful implementation of light-controlled proteolysis. The engineering of the 
photosensitive degron (psd) module may well serve as a blueprint for future development of sophisticated 
synthetic switches. 


Key words Optogenetics, Protein degradation, Proteasome, Ubiquitin-independent degradation, 
Protein stability, Synthetic biology, LOV2 domain, Blue light, Degron 


1. Introduction 


Regulated proteolysis is involved in virtually each and every regu- 
latory process in eukaryotic cells. Continuously, protein levels are 
adjusted during passage through the cell cycle or as reaction on 
changing environmental conditions, which results in the degrada- 
tion of selected proteins by the ubiquitin-proteasome system. 
Ubiquitin is a small protein that serves among other things as a 
marker for proteolysis by the proteasome. In general, an enzy- 
matic cascade of ubiquitin-activating enzyme (E1), ubiquitin- 
conjugating enzyme (E2) and ubiquitin-protein ligase (E3) 
modifies a protein substrate with several moieties of ubiquitin in 
the form ofa linear chain, which targets the protein for destruction 
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by the proteasome [1, 2]. The substrates contain linear motifs that 
are recognized by a specialized E3, which ensures selectivity of the 
process. These motifs are called degradation-inducing sequence 
or shortly degron, if they are necessary and sufficient for recogni- 
tion by the degradation machinery. Thus, modification of a stable 
protein by a degron will lead to its destabilization. In rare cases, 
proteins are also directly recognized by the proteasome and 
degraded in a ubiquitin-independent way [3]. 

Due to the importance of proteolysis in eukaryotic cells, syn- 
thetic control of protein stability is a powerful tool to interfere 
with regulatory mechanisms and to influence cellular activities. To 
implement synthetic regulation, the activity of a degron is switched 
from an inactive to an active state by a distinct signal. Until now, 
signals like temperature, small molecules, nutrients, cell cycle stage 
or light have been used to regulate the stability of conditional 
degrons in eukaryotic cells [4-9]. To confer synthetic regulation 
on a selected target protein, one of the degrons is fused to the tar- 
get gene to create a cell line in which target protein abundance and 
activity is controlled synthetically. 

Development of such a degradation tool requires two protein 
domains or sequences: a sensor domain that converts the signal 
into an output, e.g., a conformational change and a degron that is 
controlled by this switch. For generation of a light-controlled 
degron, the light-oxygen—voltage (LOV) 2 photoreceptor domain 
of Arabidopsis thaliana phototropin 1 was selected as sensor due to 
precise knowledge about the molecular changes in the LOV2 
domain after light-exposure. The LOV2 domain binds the cofactor 
flavin-mononucleotide (FMN), which is excited by blue light. 
Subsequently, the side-chain of a cysteine residue of LOV2 forms a 
covalent bond with the C4a atom of FMN. This induces structural 
changes in the LOV2 core resulting in unfolding of the so-called 
Ja-helix at the carboxy terminus of the photoreceptor [10, 11]. 

The degradation sequence selected for the conditional degron 
was derived from the well-studied murine ornithine decarboxylase 
(ODC). The ODC degron sequences are located at the very 
carboxy-terminus of the enzyme. It is somewhat unusual, as ubiq- 
uitin is not necessary for proteasomal degradation of ODC, whereas 
the vast majority of proteasome substrates are degraded in an ubiq- 
uitin-dependent way [12]. The active ODC degron has two require- 
ments: a cysteine-alanine motif located 19 amino acids upstream of 
the C-terminus flanked by sequences without secondary structure 
comprising a length of 37 amino acids in total [13, 14]. 

To generate the light-activated degron, the A. thaliana LOV2 
domain was fused to a 23 amino acid long peptide of a synthetic 
variant of the ODC degron called cODC1 [9, 15]. Several variants 
of this construct were produced to assess the overall performance 
of the degron, which was finally named photosensitive degron 
(psd) module (Fig. la). The variants that were used in comparison 
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a psd module: AtLOV2 (M460 - P616) - ODC23 from cODC1 
MRKGIDL...N/DEAVRELP-MSCAQESITSLYKKAGSENLYFQ 


variations: 
- AtLOV2 (M460 - £614) - ODC36: GITHGMDELYKLPMSCAQESITSLYKKAGSENLYFQ 
- AtLOV2 variants with - cODC1°4°*: MSCACAQESITSLYKKAGSENLYFQ 
point mutations in core - cODC1°4°4°4: MSCACACAQESITSLYKKAGSENLYFQ 
domain and Ja-helix -d€gope (murine ODC): MSCAQESGMDRHPAACASARINV 


b 


IK? 
hv ‘4® Pe A 
NAS "1 
mitt REP-AILOV2-cODC1 km —> Qt LOV2 Sime GBS & Sui 
cODC1 IN 5. 
&, 
Inactive degron Active degron @elsley Proteasomal apse 
degradation 
c 


Yeast cells grown in darkness: Yeast cells grown exposed to blue-light: 
high red fluorescence intensity low red fluorescence intensity 


Fig. 1 Engineering of the photosensitive degron (psd) module. (a) Design of the psd module and its variations. 
The psd module is composed of A. thaliana LOV2 of phototropin 1 and a degron derived from murine ODC. As 
photoreceptor, the LOV2 domain (amino acids M460 to P616 indicated by blue color) was chosen; at the 
C-terminus it carries the so-called Ja-helix (orange color). The full degron sequences are given (green color). 
The control for maximum degron activity was a fusion of LOV2 with ODC36, the control for minimal activity was 
a LOV2 domain lacking any degron sequence. Degron sequences similar to ODC23 were fused to the LOV2 
domain during optimization of the psd module. Variations in the LOV2 sequence comprised point mutations 
that kept the LOV2 domain in the lit (LOV2'°%) or the dark (LOV'*) state and were used to characterize the 
behavior of the module with minimal and maximal photoreceptor activity. To optimize the construct, site- 
directed mutagenesis was performed and the fusion point between LOV2 and the degron sequence was var- 
ied. Finally, the whole construct was used in a random mutagenesis approach. (b) Mechanism of psd module 
activation by blue light. Measurement of psd module variant performance was done with the constructs fused 
to red fluorescence protein (RFP). The constitutive ADH7 promoter was used to express psd module variants. 
In vivo, the psd module is inactive in darkness and the protein is stable. Upon blue light excitation of the LOV2 
domain, the Ja-helix (indicated in yellow) is unfolded and the cODC1 degron is exposed and activated. This 
induces ubiquitin-independent degradation of the construct by the proteasome. (c) Scheme showing RFP-psd 
behavior in yeast. During growth in darkness, the RFP-psd fusion protein is highly abundant and the yeast cells 
show high levels of red fluorescence. Cells grown in the presence of blue light show low fluorescence levels 
due to depletion of RFP-psd 
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Table 1 


with the psd module were constructs with the full CODC1 sequence 
and without any degron sequence to assess the maximal and mini- 
mal degradation activity, respectively. Similarly, LOV2 mutants 
that abolish signaling and lock the photoreceptor in the dark state 
or the light state were used as well [9]. The optimization process 
included the testing of psd module variants with changes in the 
degron sequence, alterations of the fusion sequence between 
LOV2 and cODC1 sequence, as well as testing of variants obtained 
by site-directed mutagenesis and random mutagenesis [9, 16]. For 
all constructs, red fluorescent protein (RFP) was used as reporter 
sequence, which allowed facile quantification of psd module vari- 
ant performance (Fig. 1b, c). Examples for the diverse characteris- 
tics that were measured in psd module variants are given in Table 1. 

Several conclusions can be drawn from our approach of gener- 
ating a synthetic degron reactive to light, which might be valid for 
the generation of other synthetic constructs as well. For successful 
generation of a synthetic switch, the molecular mechanisms within 
the sensor and the effector domain should be known. It is essential 
to take every detail into account that influences the activity states 
of both domains. Several different fusions of both domains should 


Characteristics of selected psd module variants 


Half-life in Half-life in 30 pmol m~*s~' Dark—light ratio of 


Name darkness (min) blue light (min) fluorescence measurements 
Wild type psd module 123+21 20+1 1120.5 
V19I 132 + 50 28+4 n.d. 
K92R E132A E155G 102 + 41 12+0.4 AD) he Sey? 
K92R E132A E139N 66 + 10 9.8+04 lo 12* 
N148D E155G 
K121 M N128Y 44+8 8.5 + 0.3 14+ 1.4* 
K121 MN128Y G138A 92 + 28 131 22 + 4* 
G138A 147 + 52 22 n.d. 
G138A N148E 151 +53 75 +14 n.d. 
AL156 AP157 187 +45 By te) n.d. 
degopc 86+21 20+3 n.d. 
CACA 372i 20+3 n.d. 
CAGAGCA 66 + 13 16+1 n.d. 


Half-lives were measured by cycloheximide chase analysis and dark—light ratio by fluorescence intensity measurements. 
The asterisk (*) marks darklight ratios that are considered to be statistically different from the ratio of psd (P< 0.05). 
The data in Table 1 were published previously [16]. Numbering of amino acids refers to the LOV2 domain, which starts 
at M460 in the full length phototropin 


2 Materials 


2.1 Yeast Strains 


2.2 Cloning 
Procedures 


23 Yeast Media 
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be tested, as receptor and effector domain might influence each 
other depending on the exact composition of the fusion site. Good 
controls that mimic maximum and minimum activity for both the 
receptor and the effector domain are necessary to have a good 
measure for the performance range of the synthetic switch. Once 
the switch shows reasonable performance, further optimizations 
by specific changes or a directed evolution approach may be con- 
ducted. In the latter case, it makes sense to allow changes in both, 
the signal receptor and the effector domain simultaneously, as the 
best performing variant might require alterations in both parts. 
Finally, a quick assay that is easily quantifiable is crucial to obtain 
reliable data about the performance of the different variants. The 
following part of this chapter contains a detailed description of the 
engineering steps and the yeast techniques that have been used to 
obtain and improve the photosensitive degron module. 


For homologous recombination and analysis of the psd module, 
no specific yeast strain is required; every wild type lab strain that 
has the necessary auxotrophy markers can be used (e.g., ESM356-1 
[17], S288C background). 


1. High-Fidelity DNA amplification system (e.g., Phusion, KOD, 
or Herculase) for amplifying large DNA fragments including 
polymerase 10x reaction buffer, 10 mM dNTP stock solutions 
each and oligonucleotide primers. 


2. Taq DNA polymerase for error prone PCR including Taq 
polymerase, 10x reaction buffer, 10 mM MnCl, stock solution, 
separate 10 mM stock solutions for dATP, dGTP, dCTP, and 
dTTP in order to optimize relative ratios for error prone PCR 
conditions and oligonucleotide primers. 


3. QuickChange Site Directed Mutagenesis kit (Agilent). 


4. Dpnl restriction enzyme (for removing residual DNA template 
after PCR). 


Standard media were used for yeast growth [18]. 


1. Yeast complete medium (YPD): 1% (w/v) yeast extract, 2% 
(w/?) peptone, 2% (w/v) glucose. 


2. Low-fluorescence medium [16]: 5 g/l (NH4)2SO., 1 g/l 
KH2POg, 0.5 g/l MgSOu, 0.1 g/l NaCl, 0.1 g/l CasCl, 0.5 
mg/l H;BO,, 0.04 mg/l CuSO,, 0.1 mg/l KJ, 0.2 mg/l 
FeCl;, 0.4 mg/l MnSO,, 0.2 mg/l Na,MoOu,, 0.4 mg/l 
ZnSO,, 2 mg/l biotin, 0.4 mg/l] calcium pantothenate, 2 
mg/1 inositol, 0.4 mg/I niacin, 0.2 mg/1 4-aminobenzoic acid 
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2.4 Yeast 
Transformation 


2.5 Plasmid Rescue 
from Yeast 


2.6 Cell Lysis 
and Immunoblotting 


2.7 Illumination 
of Yeast with Blue 
Light 


(PABA), 0.4 mg/l! pyridoxine HCl, 0.4 mg/I thiamine, 2% 
(w/r) glucose, supplemented with appropriate amino acids 
and/or other nutrients. 


. Solid synthetic complete medium (SC): 0.15% (w/») Bacto 


yeast nitrogen base without amino acids and without ammoni- 
umsulfate, 0.5% (w/v) ammoniumsulfate, 0.2% (w/v) drop-out 
mix lacking the compound used for plasmid selection, 2% 
(w/r) glucose, 2% (w/v) agar. 


. SORB buffer: 100 mM lithium acetate, 10 mM Tris-HCl pH 


8.0, 1 mM EDTA-NaOH pH 8.0, 1 M sorbitol, adjusted with 
acetic acid to pH 8.0 and filter-sterilized. 


. Carrier DNA: 10 mg/ml herring sperm DNA, denatured at 


100 °C for 10 min and immediately cooled on ice. 


. Polyethylene glycol (PEG) buffer: 100 mM lithium acetate, 10 


mM Tris-HCl pH 8.0, 1 mM EDTA-NaOH pH 8.0, 40% 
(w/v) PEG 4000; filter-sterilized. 


. Breaking buffer: 2% (v/v) Triton X-100, 1% (w/v) sodium 


dodecylsulfate (SDS), 100 mM NaCl, 10 mM Tris-HCl pH 
8.0, 1 mM EDTA-NaOH pH 8.0. 


1. Alkaline lysis buffer: 1.85 M NaOH, 7.5% #-mercaptoethanol. 
. High urea buffer: 5% (w/v) SDS, 8 M urea, 200 mM Na,HPO,/ 


NaH,PO, pH 6.8, 0.1 mM EDTA, 0.01% (w/v) bromophenol 
blue. 


. Protein-precipitation buffer: 55% trichloroacetic acid (TCA) 


(w/) 


4, Cycloheximide stock solution: 20 mg/ml. 
. Sodium azide buffer: 100 pM NaN3. 


6. Antibodies recognizing tagRFP are available from evrogen 


(www.evrogen.com), antibodies against Tubl (loading con- 
trol) were obtained from Abcam (www.abcam.com). 


. Light emitting diode (LED) stripes or clusters for illumination 


of yeast cells grown on plate or in liquid medium. The LED 
setup should include a dimmer to achieve a photon flux of 30 
mol m7 s“! at the level of the yeast cells. Single wavelength 
LEDs for blue light illumination (output wavelength 465 nm) 
or RGB LEDs with an appropriate controller can be used. 
Homogeneous illumination is advisable, which might be easier 
to achieve with many LEDs of lower light intensity output 
then with few high-power LEDs. 


. An optometer (e.g., P2000, equipped with light detector 


D-9306-2, Gigahertz-Optik, Tiirkenfeld, Germany) to mea- 
sure the light-intensity used for illumination of yeast cells. 


3 Methods 


3.1 Cloning 
by Homologous 
Recombination 
in Yeast 


3.2 Design 

of Primers for Cloning 
by Homologous 
Recombination 


Controling Protein Stability with Light 247 


. Clear plastic cell culture flasks equipped with a ventilated cap 


for illumination of yeast cultures growing in liquid medium. 


4. A lightproof box to grow yeast cells in darkness. 


. A box with LEDs attached to the lid, spaced evenly to ensure 


uniform illumination. Covering the interior of the box with 
reflective material optimizes illumination. The box should be of 
sufficient size to host several Petri dishes or cell culture flasks. 


. Lightproof, heat shrinkable tubing adjusted to a diameter 


which is slightly bigger than a test tube to protect small yeast 
cultures from ambient light. To create the tubing, test tubes 
were covered with aluminum foil and placed in heat shrink- 
able tubing, which was subsequently adjusted in size with a 
heat gun. After cooling, the aluminum foil was removed with 
forceps. 


Generation of shuttle vectors for yeast and Escherichia coli by 
homologous recombination has been described previously [19]. 
The method can be adapted in many ways to generate gene fusions 
(Fig. 2). 


1. 


Design primers for amplification of the gene of interest. The 
primers have two parts: for the 3’-end, sequences that allow 
AtLOV2 amplification are chosen. At the 5’-end, sequences 
derived from the vector are added, which will be used for the 
homologous recombination step in yeast that generates the 
final vector. 


. Perform PCR to generate the DNA fragment with the LOV2 


photoreceptor domain (see Note 1). To generate photorecep- 
tor variants, a PCR using mutagenic conditions can be per- 
formed (see Note 2). 


. Linearize the target vector by restriction enzymes. The 


sequences for homologous recombination should flank the gap. 


. Cotransform the fragments into frozen-competent S. cerevisiae 


cells. 


. Rescue the plasmid from yeast into E. coli cells, isolate it and 


verify the construct by enzymatic digest and sequencing. 


Target gene-specific primers contain sequences that are homolo- 
gous to the target gene and sequences homologous to the vector 
sequence, which are added to the 5’ ends (Fig. 2). The resulting 
PCR product contains the target gene (e.g., LOV2 photoreceptor) 
flanked by sequences for homologous recombination. In the same 
way, the vector backbone can be generated with one or several 
oligonucleotide pairs (see Note 1). 
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Workflow for the Generation of a synthetic Construct to control Protein Stability by Light 


Step 1: design of up and down primers M8) HE vector-specific sequences up-primer J 
with 3'-sequences homologous (35-45 base pairs) 


to the AfLOV2 domain and 5'-sequences 


homologous to the vector sequence gene-specific sequences = down-primer [ii 


Q up-primer 
Step 2: PCR to generate the insert product AtLOV2 
using an AtLOV2 containing plasmid pane t 
down-primer 


vector opened with — [unrelated sequence] Vector opened with 


restriction enzyme \ V4 restriction enzyme 


Step 3: linearization of the vector 


by enzymatic digest ( . ”) 


iW \Z 
Step 4: in yeast ligation by Z\ / 
homologous recombination 
Step 5: plasmid rescue from ( y 
yeast into E. coli, verification O O90. O O. O 


and retransformation in yeast 


Step 6: functional characterization of the construct under different conditions 


Fig. 2 Workflow chart of psd module generation. Step 7: Primer design strategy for the cloning of the photore- 
ceptor LOV2 into a yeast vector by homologous recombination. The primers have two parts; the 3’-end con- 
tains the sequence necessary to amplify the LOV2 gene, the 5’-end consists of sequence, which is identical to 
the vector sequence and will be used for homologous recombination in yeast. Step 2: Generation of the PCR 
product for homologous recombination. For a directed evolution approach, this PCR can be performed with 
conditions that enhance the error rate of the polymerase, which results in PCR products containing a library of 
photoreceptor mutants. Step 3: Linearization of the target vector, the sequences for homologous recombina- 
tion should flank the gap in the plasmid. Step 4: Cotransformation of the linearized vector and the PCR prod- 
ucts results in generation of the plasmid by homologous recombination. In case a directed evolution approach 
is undertaken, an appropriate assay is performed at this stage to identify interesting clones. Step 5: Plasmid 
rescue from yeast into E. coli to isolate the plasmid. The identity of the plasmid can be tested by enzymatic 
digest and sequencing. Yeast is transformed with verified constructs for further tests. Step 6: Functional char- 
acterization of the novel construct; quantification of target protein levels by fluorescence measurements and 
immunoblotting as well as cycloheximide chase analysis at different illumination conditions. Characteristics of 
several psd module variants are shown in Table 1 


3.3 High-Fidelity 
and Error-Prone PCR 
to Generate DNA 

for Homologous 
Recombination 


3.4 Site-Directed 
Mutagenesis to Create 
Specific psd Module 
Variants 


3.5 Yeast 
Transformation 
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1. Standard PCR conditions can be used to generate the products 
used for homologous recombination. For cloning purposes, a 
kit containing a high-fidelity polymerase (e.g., Phusion, KOD, 
or Herculase) is advisable to reduce the introduction of errors 
during the reaction. 


2. Fora directed evolution approach, error-prone PCR followed 
by in yeast ligation generates a library of clones [16, 20]. 


3. PCR is performed with Taq polymerase under conditions that 
increase the error rate of the enzyme which is achieved by the 
presence of different concentrations of MnCl, (0, 0.62, and 
1.25 mM) and a twofold excess of dCTP and dTTP (500 pM 
each in the final PCR reaction). 


4. If necessary, the template vector can be destroyed after the 
PCR step by enzymatic digest with DpnI, which cleaves 
methylated DNA. Products obtained in this way are used 
without further manipulations for homologous recombina- 
tion in yeast. 


The QuickChange protocol can be followed to change single posi- 
tions in the psd module by site-directed mutagenesis, which may 
result in psd modules with distinct characteristics. Mutations 
should be selected based on previous studies examining the same 
photoreceptor domain or a homologous one. Furthermore, 
changes that have been found in different variants obtained by ran- 
dom mutagenesis can be combined in a single construct by this 
method. 


1. The oligonucleotides are designed as follows: the changed 
codon is flanked by 15 bases matching the vector sequence. 
Both forward and reverse primers are the reverse complement 
of each other. At each end, a cytosine or guanidine is preferred 
due to increased binding stability. 


2. A whole vector PCR is performed with the template plasmid 
using standard conditions. A low number of cycles is recom- 
mended (<20). 


3. The template plasmid is digested with DpmI that cleaves spe- 
cifically methylated DNA. 


4, The product of the PCR is transformed into E. coli by a stan- 
dard method [21]. 


The protocol for yeast transformation is based on the lithium ace- 
tate method [22] and has been described previously [23-26]. 


1. Yeast cells are inoculated from an overnight preculture (approx. 
1:50 dilution) and grown to an optical density (Agoo) of 0.8-1.0 
at 30 °C in 50 ml of YPD medium. 
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3.6 Plasmid Rescue 
from Yeast 


. Yeast cells are harvested by centrifugation (3 min, 500 x 4), 


washed once with sterile water (0.1-0.5 volumes) followed by 
a washing step with sterile SORB buffer (0.1—0.2 volumes). 


. Cells are suspended in 450 ppl SORB buffer and 50 pl of carrier 


DNA is added. Cells are divided into appropriate aliquots and 
placed at —80 °C (no shock freezing). 


. Usually, 50 pl of competent cells are used for the transforma- 


tion with a PCR product. Thawed competent cells are mixed 
with PCR product (5-15 pl of DNA for 50 pl of cells) and six 
volumes of PEG buffer are added. 


. Cells are incubated at room temperature for approximately 


30 min. 


. Cells are incubated at 42 °C for 5-20 min (15 min works well 


with most strains). 


. Cells are sedimented (3 min, 500 x g) and washed once with 


YPD. 


. The pellet is resuspended in YPD and spread onto an appropri- 


ate selection medium. Plates with yeast cells harboring a light- 
sensitive construct should be protected from ambient light 
during growth and storage. 


A modified standard method is used to transfer plasmids from yeast 
to E. coli for further usage [21]. 


1. An amount of yeast cells corresponding to the size of a match 


head is scraped off a plate and dissolved in 500 pl of breaking 
buffer. 


.200 wl of phenol-chloroform—-isoamyl alcohol mixture 


(25/24/1, buffered with TE, pH 7.5-8) and 0.3 g of glass 
beads (~200 ul) are added. 


. Cells are disrupted by vortexing (5 min, highest speed) and 


phases are separated in a microcentrifuge (10 min, 16,000 x y). 


. 400 pl of the aqueous layer are transferred to a new tube and 


1 ml of ethanol is added to this tube. 


. The tube is incubated at —20 °C for 10 min and then subjected 


to centrifugation (10 min, 16,000 x y). 


. The supernatant is removed and the pellet is washed with 70% 


ethanol. 


. The tube is centrifuged again (3 min, 16,000 x y), the super- 


natant removed and the pellet air-dried for 3 min. 


. The pellet is dissolved in 20 pl of desalted water. 


9. 1 wl of the solution is used for transformation of E. coli cells by 


electroporation. 


3.7 Detection of the 
Target Protein RFP 
by Immunoblotting 


3.8 Measurement 

of Protein Degradation 
by Cycloheximide 
Chase 


3.9 Quantification 
of Target Protein 
Levels 

by Fluorescence 
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The target protein can be detected by immunoblotting using anti- 
bodies directed against the tester protein tagRFP. For immunob- 
lotting, crude cell extracts can be prepared by alkaline lysis [27]. 


1. 1 ml of logarithmically growing cells (Ago9 = 1) is treated with 
150 pl of alkaline lysis buffer and kept for 10 min on ice. 


2. Protein precipitation is induced by addition of 150 pl 55% 
(w/?) trichloroacetic acid (TCA) followed by 10 min incuba- 
tion on ice. 


3. The samples are subjected to centrifugation (10 min, 16,000 x 
J) and the supernatant is removed. The pellet is dissolved in 60 
ul of high urea buffer by mixing the sample vigorously at 65 °C. 


4. The extracts are cleared from cell debris by centrifugation (10 
min, 16,000 x g), 10-20 pl of sample is loaded onto an SDS- 
PAGE gel. Standard procedures can be used for SDS-PAGE 
and blotting [28, 29]. 


Cycloheximide is a translational elongation inhibitor that can be 
used to stop protein biosynthesis in eukaryotic cells. Thus, the sta- 
bility ofa protein can be inferred from the decay rate after addition 
of cycloheximide to logarithmically growing yeast cells. Target 
protein levels can be detected by an appropriate assay, e.g., 
immunoblotting. 


1. Grow yeast cells in darkness until logarithmic growth phase is 
reached. 


2. Take first sample (¢ = 0 h) and subject to alkaline lysis. 


3. Add cycloheximide to a final concentration of 200 pg/ml to 
the yeast culture to stop protein synthesis. 


4. Expose cells to specific illumination conditions, e.g., darkness or 
a distinct amount of blue light for the rest of the experiment. 


5. Collect equal amounts of sample at several time points that 
match the protein decay rate (e.g., every 30 min) and subject 
to alkaline lysis. 


6. Perform immunoblotting to measure protein abundance over 
time. 


The target protein RFP can directly detected by fluorescence mea- 
surements with a fluorimeter. For the interpretation of the results, 
the maturation time of the used fluorescent protein has to be taken 
into account. 


1. 1 ml of logarithmically growing cells (Ago9 = 1) is treated with 
100 ul of sodium azide buffer. 


2. Samples are subjected to centrifugation (3 min, 500 x g) and 
1 ml of the supernatant is removed. The pellet is dissolved in 
the remaining supernatant. 
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3.10 Blue Light 
Illumination of Yeast 


4 Notes 


3. The samples are transferred in a lightproof, flat-bottom 


microtiter plate and transferred to a fluorimeter. 


4. The fluorescence intensity of each sample is measured accord- 


ing to the protocol given by the manufacturer of the 
fluorimeter. 


The photosensitive degron module is activated by blue light (465 
nm, 30 pmol m~ s“); cells are grown under continuous illumina- 
tion (see Note 3). The illumination regimen can be easily adapted 
for yeast cells growing on solid medium (see Note 4). 


1. Yeast cells are grown in low fluorescence medium supple- 


mented with 2% glucose in darkness or under continuous illu- 
mination until mid-log phase is reached (see Note 5). 


. Degradation of the psd module is initiated by exposure of the 


cells to blue light (465 nm, 30 pmol m~ s“!). The time frame 
until depletion of the target protein is achieved depends on the 
psd module variant, but it can be expected that 2—4 h are suf- 
ficient for short-lived psd module variants (see Note 3). 
Immunoblotting or fluorescence measurements can be used to 
quantify RFP-psd levels. Measuring the abundance of the con- 
struct in cells kept under restrictive conditions (blue light- 
exposed) and permissive conditions (darkness) gives a measure 
for the depletion efficiency and the overall performance of the 
psd module variant. Ongoing target protein synthesis will lead 
to minimal amounts of target protein even after prolonged 
exposure to blue light (see Note 6). The overall performance 
of a psd module variant can be assessed with mutants, in which 
the degron or the photoreceptor is inactivated (see Note 7). 


. Homologous recombination offers high flexibility for vector 


design. The illustration shows the most simplified case, in yeast 
ligation of a single gene or gene fragment into an existing vec- 
tor (Fig. 2). However, with the same approach, several frag- 
ments can be ligated together in one step. To do so, it is 
necessary to generate DNA fragments with flanking homolo- 
gous sequences. The same strategy as depicted for AtLOV2 
can be used to combine several genes or gene fragments, e.g., 
one-step-fusion of a photoreceptor with a degron sequence. 
Also the vector backbone can be produced by homologous 
recombination of PCR fragments [19]. 


. To generate photoreceptor variants, the PCR to amplify the 


AtLOV2 domain can be performed with a polymerase lacking 
proofreading activity like the Taq polymerase. Furthermore, 
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addition of manganese ions and unbalanced amounts of 
nucleotides will increase the error rate. Standard protocols can 
be found in the literature [16, 20, 21]. 


. The FMN cofactor of the LOV2 domain is excited by blue 
light (400-500 nm) with a maximum at 450 nm. Thus, the 
wavelength used for illumination can be varied without affect- 
ing the protein destabilization activity of the psd module. At a 
light-flux of 30 pmol m~ s (465 nm), strong activation of 
the psd module has been observed, whereas yeast cell growth 
is not affected [9, 16]. A much higher light flux is not recom- 
mended, as yeast cells are sensitive to high-intensity blue light 
[30]. To characterize the behavior of the psd module at differ- 
ent light conditions, the light flux can be varied between 0 and 
30 pmol m~ s“. It is advisable to measure the actual light-flux 
before the experiment with an optometer. The illumination 
strength influences the psd module degradation rate; reduced 
illumination will prolong the time that is necessary to reduce 
the target protein amounts in most cases [9, 16]. 


. For growth in the presence of blue light on solid medium, 
yeast cells bearing a plasmid can be grown on synthetic medium 
lacking the appropriate auxotrophy compound. An upright 
orientation of the Petri dishes is recommended to expose the 
cells directly to the light, which avoids unwanted shading. The 
same LED-equipped boxes that are used for growth in liquid 
culture can be used to grow yeast on solid medium. 


. Low fluorescence medium is recommended to grow cells 
exposed to defined amounts of light. The carbon source can 
be chosen freely. Other media can be used as well, as long as 
the blue light is not completely absorbed by the medium. If 
light absorption takes place, the illumination conditions have 
to be adjusted accordingly. The main difference between low 
fluorescence medium and synthetic complete medium is the 
lack of flavin derivatives in low fluorescent medium. Ifa yeast 
strain is used that does not produce FMN in appropriate 
amounts, it can be added to the medium or synthetic com- 
plete medium can be used. In both cases, the blue light 
absorption of the flavin derivative in the medium has to be 
taken into account and the illumination conditions have to be 
adjusted accordingly. 

Depending on the experiment, different illumination regi- 
mens are applied. For cycloheximide chase experiments, cells 
are kept in darkness during the initial growth phase and illumi- 
nation is started after the first sample has been collected. For 
protein level quantifications, cells are grown under constant 
conditions, either exposed to a certain light flux or kept in 
darkness. 
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6. The psd module constructs are expressed by a constitutive 


promoter, which leads to constant synthesis of the psd mod- 
ule. Under blue light illumination, a steady-state level is 
reached, which depends on production and degradation rates 
of the protein. Usage of a regulatable promoter may further 
decrease the depletion time and lower target protein abun- 
dance at restrictive conditions. 


7. A suitable control to assess the influence of a certain mutation 


on the degradation mechanism of the construct under consid- 
eration is a mutation that abolishes cCODC1 degron activity. 
For constructs with differences in light-regulated activation of 
the degron, a similar stability is expected after inactivation of 
the degron. If a mutation in the photoreceptor domain influ- 
ences the stability of the construct without increasing the 
degron presentation, the increased degradation rate persists 
after mutational degron inactivation. Such a mutation is less 
valuable, as the contribution of the mutation is not controlled 
by light and cannot be influenced by further modifications of 
the photoreceptor. Ideally, all modifications influence the deg- 
radation rate solely by impacting on degron presentation. In 
this case, the signaling state of the photoreceptor determines 
protein stability. 


8. Another set of controls are mutations that abolish light- 


dependent signaling of the photoreceptor. For the A. thaliana 
phototropin 1 LOV2 domain examples are I608E and C512A, 
which lock the photoreceptor in the light and the dark state, 
respectively. Please note that for mutations similar to C512A 
some residual light activation has been observed in homolo- 
gous LOV domains [31 ]. 
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Chapter 16 


Light-Regulated Protein Kinases Based 
on the CRY2-CIB1 System 


Wignand W.D. Muhlhauser, Maximilian Horner, Wilfried Weber, 
and Gerald Radziwill 


Abstract 


Optogenetic approaches enable the control of biological processes in a time- and space-resolved manner. 
These light-based methods are noninvasive and by using light as sole activator minimize side effects in 
contrast to chemical inducers. Here, we provide a protocol for the targeted control of the activity of protein 
kinases in mammalian cells based on the photoreceptor cryptochrome 2 (CRY2) of Arabidopsis thaliana 
and its interaction partner CIB1. Blue light (450 nm)-induced binding of CRY2 to CIB1 allows the 
recruitment of a chimeric cytosolic protein kinase AKT1 to the plasma membrane accompanied with 
stimulation of its kinase activity. This protocol comprises the transient and stable implementation of the 
light-regulated system into mammalian cells and its stimulation by blue light-emitting diodes (450 nm) 
irradiation as well as analysis of the light-activated AKT1. 


Key words Optogenetics, Signal transduction, Protein kinases, Membrane recruitment, AKT, CRY2 


1 Introduction 


Protein kinases play a key role in orchestrating the complex cellular 
signal transduction machinery. By phosphorylation they can influ- 
ence activity, localization, and/or the interactome of these signal- 
ing complexes. Thereby, protein kinases significantly influence cell 
fate decisions (e.g., proliferation vs. differentiation; survival vs. 
apoptosis). For this reason, it is not surprising that protein kinases 
themselves underlie a tight regulation. Reoccurring mechanisms 
comprise regulation by other kinases, by local recruitment to 
specific subcellular compartments or clustering (e.g., dimeriza- 
tion) of kinases [1]. 

Conventional approaches, which try to mimic activation 
mechanisms, often employ chemical inducers or mutants. Major 
drawbacks of those approaches lie in their poor or altogether miss- 
ing spatiotemporal resolution and, in certain cases, unwanted or 
unknown side effects in the case of chemical inducers. Therefore, 
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the use of constantly improving light-inducible dimerization/ 
oligomerization systems (i.e., optogenetic tools) becomes more 
and more popular. Using light as the sole activator allows unprec- 
edented spatiotemporal resolution with minimal side effects. This 
allows the intervention of signal transduction in a most sophisti- 
cated manner [2, 3]. 

One optogenetic tool that has been employed in the activation 
of various kinases is the blue light-responsive cryptochrome 2/ 
cryptochrome-interacting basic-helix-loop-helix 1 (CRY2/CIB1) 
system [4-6]. Cryptochromes are flavoprotein photoreceptors 
found in bacteria, higher plants, and animals. They were first iden- 
tified in Arabidopsis thaliana, where they regulate growth and 
developmental processes [7 ]. Cryptochromes contain a C-terminal 
cryptochrome extension and an N-terminal photolyase homology 
region (PHR) that noncovalently binds flavin adenine dinucleotide 
(FAD). Irradiation with blue light of wavelengths ranging from 
390 nm to 480 nm leads to FAD reduction, which converts cryp- 
tochrome into its biological active conformation [8]. The activated 
Arabidopsis thaliana CRY2 can interact with CIB1, which is 
involved in the mechanism of floral initiation. Besides the interac- 
tion with CIB1, photoexcited CRY2 can also form homomers. 
Light-activated CRY2 forms homo- and heteromers within seconds 
and dissociates in the dark within minutes in a repeatable manner. 
Optogenetic systems have been generated based on the light- 
induced oligomerization of the PHR region of CRY2 (aa 1—489) 
and the N-terminus of CIB1 (CIBN; aa 1—170). The characteristics 
of the CRY2-based optogenetic are especially suited for the control 
of signaling events in mammalian cells [3, 9] (Fig. 1). For instance, 
blue light-induced clustering of CRY2PHR fused to the protein 
kinase RAF as well as the recruitment of chimeric RAF to mem- 
brane-bound CIBN activates the mitogen-activated protein kinase 
cascade (MAPK) signaling pathway [4, 5, 10]. To gain control of 
the PI3K/AKT pathway, CRY2PHR either fused to the inter- 
SRC-homology 2 (iSH2) domain of the regulatory p85a@ subunit 
of PI3K or to the protein kinase AKT were recruited to a mem- 
brane-anchored CIBN to activate signaling upon blue light irra- 
diation [6, 11]. Due to its high relevance constant efforts to 
further improve this system are undertaken (e.g., enhanced oligo- 
merization) [12, 13]. 

In this chapter, we describe a detailed approach of how to use the 
CRY2PHR/CIBN optogenetic tool to control AKT] (optoAKT) 
to activate the PISK/AKT signaling pathway. A detailed description 
of the experimental setup, as well as a step-by-step instruction of the 
experimental procedure, will be given. By following the instructions 
provided, the experienced scientist will be able to generate, either 
transiently or stably, optoAKT-expressing cells. Upon irradiation 
with blue light AKT1, fused to the PHR domain of CRY2, is 
recruited toward a membrane-bound CIBN. The light-dependent 
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Fig. 1 Functional principle of a CRY2/CIBN-based optoKinase. Irradiation of the system with blue light (450 nm) 
results in a conformational change of CRY2 to its biological active form within seconds. (a) The photoexcited 
CRY2 fusion proteins form homooligomers which leads to clustering of the chimeric kinase. (b) In the presence 
of CIBN, photoexcited CRY2 additionally forms heterodimers with CIBN. This characteristic can be utilized for 
recruitment of the chimeric kinase toward specific subcellular compartments. Oligomers dissociate in the dark 
within minutes. CRY2: photolyase homology domain of CRY2, aa 1—489; C/BN: CIB1, aa 1—170 


recruitment toward the membrane leads to the activation of AKT1. 
The validation of kinase activation is shown by Western blot analysis 
using antibodies against AKT, AKT),730s, and AKT)s473. The used 
constructs further contain fluorescence proteins, which further allow 
the visualization of ight-dependent membrane recruitment via fluo- 
rescence microscopy. The described methods can be transferred to 
other protein kinases depending on oligomerization or recruitment 
to defined subcellular regions for activation. 


2 Materials 


2.1 Cell Culture 1. Culture medium: Dulbecco’s modified Eagle medium 
(DMEM) supplemented with 10% (v/v) fetal calf serum (FCS) 
and penicillin (100 U/ml)/streptomycin (100 pg/ml). Stored 
at 4 °C. 

2. Phosphate buffered saline (PBS): 10 mM Na,HPO,, 2 mM 
KH2PO,, 137 mM NaCl, and 2.7 mM KCl, pH 7.4. 

3. Trypsin-EDTA solution: 0.05% Trypsin, 0.02% EDTA in PBS 
without Ca** and Mg?*. Stored at 4 °C. 

4. CASY model TT cell counting device, 150 jim (Roche, cat. no. 
05651697001) with CASY cups (Roche, cat. no. 05651794001) 


and CASY ton buffer (Roche, cat. no. 05651808001). 
Alternatively, a standard hemocytometer can be used. 
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2.2 Transient 
Transfection 


2.3 Transduction 


2.4 Irradiation 
and Readout 


2.5 Western Blot 


Bi 


Cell culture-certified disposable plasticware: 10-cm Petri 
dishes, 6-well and 24-well plates, serological pipettes, and 15- 
and 50 ml conical centrifuge tubes. 


. Centrifuge with rotor which allows centrifugation of 15 ml 


conical centrifuge tubes. 


. CO, incubator at 37 °C and 5% CQO. 


. Opti- MEM (Thermo Scientific, cat. no. 31985-062). Store at 


4 °C. 


. Polyethylenimine (PEI) transfection solution: 1 mg/ml PEI in 


HO, adjusted to pH 7.0 with HCl. After sterile filtration (© 
0.22 pm) 500 pl aliquots are stored at —80 °C. Avoid repeated 
freeze-thaw cycles. 


. Silica-based anion-exchange DNA purification Kits. 


. Advanced DMEM (Thermo Scientific, cat. no. 12391-015). 
. Cholesterol: 5 mM diluted in PBS. 


3. Egg lecithin (Serva Electrophoresis GmbH): 5 mM diluted in 


Ethanol. 


. lx chemical defined lipid concentrate (Gibco, cat. no. 


11905031). 


. Hexadimethrine bromide (polybrene): Dilute in deionized 


water to a final concentration of 8 mg/ml. 


. Centrifuge that allows centrifugation of cell culture plates. 


. Syringe and 0.45 pm filter (low protein binding). 


. Light boxes: Build out of opaque PVC material. Panels of 


LEDs emitting light with a wavelength of 450 nm should be 
placed on the top of the box (Roithner LaserTechnik, LED450- 
series). The intensity of the LEDs has to be adjustable. Further, 
it is advantageous to include a mean of further programming 
the LEDs ON/OFF time to address temporal aspects of sig- 
naling. A ventilation of the light box should be included to 
ensure that the temperature as well as atmosphere in the box 
corresponds to the surroundings (see Note 1). 


. LED safe lights with wavelength longer than 500 nm. 


. 5x SDS-loading buffer: 10% (w/v) SDS, 0.3 M Tris-HCl (pH 


6.8), 50% (v/v) Glycerin, 12.5% (v/v) $-mercaptoethanol, 
0.05% (w/v) bromphenol blue in fully desalinated H,O. 


. ECL solutions. 


3. TENT cell lysis buffer (modified): 20 mM Tris-HCl (pH 8.0), 


1 mM EDTA, 100 mM NaCl, 0.5% (v/v) Triton X-100, 1 mM 
sodium orthovanadate, 10 mM _ sodium pyrophosphate, 


2.6 Fluorescence 
Microscopy 


2.7 Cell Lines 
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50 mM sodium fluoride, 10 mM &-glycerophosphate, 1x prote- 
ase inhibitors (Roche, cat. no. 04693132001), 0.1% SDS 
(w/v). Store at 4 °C. 


. Running buffer: 192 mM glycine, 25 mM Tris, 0.1% (w/v) 


SDS in fully desalinated H,O. 


. PVDF membrane. 


6. Transfer buffer: 125 mM glycine, 25 mM Tris, 10% (v/v) 


10. 


Bm ow WN 


methanol in fully desalinated H,O. 


. Tris-buffered saline, 0.025% (v/v) Tween-20 (TBS-T): 50 mM 


Tris-HCl (pH 7.5), 150 mM NaCl in distilled H,O. Adjust 
pH to 7.5 with HCl before adding 0.025% (v/v) Tween-20. 


. TBS-T supplemented with 5% BSA (v/v). 
. Primary antibodies (Cell Signaling Technology): AKT (pan) 


(C67E7) (cat. no. 4691), Phospho-AKT (Ser473) (cat. no. 
9271), Phospho-AKT (Thr308) (cat. no. 4056). 


Secondary antibodies (Santa Cruz Biotechnology): goat anti- 
rabbit IgG-HRP (cat. no. sc-2030). 


. 1,4-diazabicyclo[2,2,2 Joctane (DABCO). 
. Dulbecco’s phosphate buffered saline (DPBS). 
. Immersion oil. 


. Mowiol mounting solution: 2.4 g mowiol, 6 g glycerol in 


133 mM Tris-HCl, pH 8.5. Store at 4 °C. 


. Paraformaldehyde (PFA) cell fixation solution: 4% (w/v) PFA 


in PBS. Store at —20 °C. 


. 12 mm round cover slips. Autoclave and store in a sealed con- 


tainer prior to use. 


. Fluorescence microscope with filter for green and red fluores- 


cent proteins. 


. Microscope slides. 


. C2C12: DSMZ, Braunschweig, Germany, cat. no. ACC 565. 


2. HEK293T: DSMZ, Braunschweig, Germany, cat. no. ACC 635. 


. Platinum E: Cell Biolabs, INC., San Diego, USA, cat. no. 


RV-101. 


. MCF7: DSMZ, Braunschweig, Germany, cat. no. ACC 115. 


HEK293T, Platinum E, and MCE7 cells are cultured in DMEM 


supplemented with 10% (v/v) FCS and penicillin (100 U/ml)/ 
streptomycin (100 pg/ml). C2C12 cells are cultured in DMEM sup- 
plemented with 15% (v/v) FCS and 1% (v/v) penicillin/streptomycin. 
Starvation was performed with DMEM supplemented with penicillin 
(100 U/ml)/streptomycin (100 pg/ml) (see Note 2). 
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Table 1 
List of constructs used for the described methods in this protocol 


Vector Description Reference 


PA-AKT CRY2(PHR)-YFP-AKTI1 [6] 
Vector coding for CRY2(PHR)-YFP-AKT1 in a pMX backbone (Cell 
Biolabs, Inc.). Suitable for generation of ecotropic retroviral vectors 
(Fig. 3). 


pGR427 CRY2(PHR)-EGFP-HA-AKTI1 This work 
Vector coding for CRY2(PHR)-EGFP-AKT1 in pcDNA3.1 backbone 
(Thermo Fisher Scientific Inc). Suitable for transient transfection 
(Fig. 2a). 
pGR464 CIBN-mCherry-Caax This work 


Vector coding for CIBN-mCherry-CaaX in pcDNA3.1 backbone. 
Suitable for transient transfection (Fig. 2a). 


pwWM043 CRY2(PHR)-EGFP-HA-AKTIAPH (Fig. 2b) This work 
Vector coding for CRY2(PHR)-EGFP-HA-AKT1 py in pLVX 
backbone (Clontech Laboratories, Inc.). Suitable for the generation 
of lentiviral vectors. 


pwWM023 m/p-mCherry-CIBN This work 
Vector coding for m/p-mCherry-CIBN in pLVX backbone. Suitable 
for the generation of lentiviral vectors. (Fig. 2b) 


pwWM029 m/p-mCherry-CIBN This work 
Vector coding for m/p-mCherry-CIBN in pMX backbone. Suitable 
for the generation ecotropic retroviral vectors. (Fig. 3) 


pLTR-G Helper plasmid spanning the 3’ LTR of HIV-1 upstream of the [14] 

(Addgene vesicular stomatitis virus G protein (VSV-G) 
#17532) 

pCD/NL Helper plasmid coding for the HIV-1 Gag/pol, Tat, Rev. genes [15] 
BH*DDD 

(Addgene 
#17531) 

2.8 DNA Constructs See Table 1 for the list of constructs. For the transfection, it is 


important to use high-purity plasmid DNA, which can be obtained 
for example with Jetstar 2.0 Midiprep kit. 


3 Methods 


All cells are incubated in a cell culture incubator with a humidified 
5% CO, atmosphere at 37 °C. All work with living cells outside the 
incubator is conducted in a sterile cell culture bench at room tem- 
perature. Transduction using lentiviral vectors is performed under 
BSL2 precautionary measurements. 


3.1 Expression 
of optoAKT 
in Mammalian Cells 


3.1.1 Transient 
Transfection 


Optogenetic Control of Protein Kinases 263 


1. Day 1: Begin with a 10-cm plate with ~80% confluent 


HEK293T (transfection and lentiviral vector transduction) or 
Platinum E (retroviral vector transduction) cells. Pre-warm 
culture medium, PBS, and trypsin-EDTA solution in a 37 °C 
water bath. When preparing samples for fluorescence microscopy, 
place autoclaved coverslips into wells of 24-well plate. 


. Aspirate culture medium and wash once with PBS to remove 


residual culture medium before adding 1 ml of trypsin-EDTA 
solution. Place cells back into incubator for ~3 min. After incu- 
bation use a serological pipette to flush the plate with 5 ml 
culture medium and transfer the cell suspension into a 15 ml 
conical centrifuge tube. Centrifuge the conical centrifuge tube 
containing the cell suspension for 3 min at 300 x y. Afterward, 
discard the supernatant and resuspend the cell pellet in 10 ml 
culture medium. Dilute appropriate amount of cell suspension 
(dependent on CASY setup) in 10 ml CASY ton buffer within 
a CASY cup and determine cell number using CASY cell counter 
(see Note 3). 


. Seed HEK293T cells with a density of 0.75 x 10° cells/cm”. 


Either diluted in 2 ml culture medium for 6-well plates (for 
Western blot analysis) or in 0.5 ml culture medium for 24-well 
plates (for fluorescence microscopy). It is important that for 
each experimental condition at least one replicate on a separate 
plate is prepared, which is going to serve as dark control later. 


. Day 2: Cells should have reached ~60—-70% confluence for a 


good transfection efficiency. For the transfection of a 6-well 
use a total of 3 pg DNA and for 24-well plate a total of 0.75 pg 
DNA. Transfect pGR427 and pGR464 in a ratio of 1:1 (see 
Note 4). For sole transfection of pGR427 use an empty vector 
instead of pGR464 to guarantee comparable transfection con- 
ditions. Also include an empty vector control to ensure that 
irradiation itself is causing the observed effects. 


. Dilute the DNA in 100 pl or 50 pl opti- MEM for 6-well and 


24-well, respectively. Next add PEI solution (9 ul for 6 well; 
2.25 pl for 24 well) and vigorously vortex PEI/-opti-MEM/- 
DNA solution for 5 s (see Note 5). Let the solution incubate 
for 15 min at room temperature. Prior to applying the solution 
pipette it up and down a couple of times to ensure homogene- 
ity then add it dropwise onto cells. Afterward, gently rock the 
plate to evenly distribute the PEI/-optit MEM/-DNA mixture 
and put the cells back into the incubator (see Note 6). 


. After 5 h replace the transfection medium with fresh culture 


medium and return the cells back into the incubator. After an 
incubation period of ~24 h proceed with the experiment (con- 
tinue with Subheading 3.2). 
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3.1.2 Stable 
Transduction 

of Mammalian Cells 
(See Note 7) 


1. Day I: For lentiviral vector production complement 10 ml of 


Advanced DMEM with 2% (v/v) FCS and penicillin (100 U/ 
ml)/streptomycin (100 pg/ml). Further add 20 ul cholesterol, 
20 pl egg lecithin, and 1x chemical defined lipid concentrate. 
Dilute HEK293T cells in 10 ml with a density of 0.75 x 10° 
cells/cm? and seed them in a 10-cm plate. 

For retroviral vector production dilute Platinum E cells 
with a density of 0.75 x 10 cells/cm? in 10 ml culture medium 
and seed them in a 10-cm plate (see Note 8). 


. Day 2: Cells should have reached ~60—70% confluence before 


continuing. For lentiviral vector production prepare PEI trans- 
fection mix by adding 5 pg pLTR-G, 5 pg pCD/NLBH*DDD, 
10 pg pWM043 (or pWMO023), and 1 mlopti-MEM. Afterward, 
add 60 pl PEI and vigorously vortex the solution for 5 s. For 
retroviral vector production dilute 8 pg PA-AKT (or pWM029) 
in 1 ml opti- MEM. Then add 24 il PEI and vortex vigorously. 
In both cases, incubate the mixture for 15 min at room 
temperature before adding it dropwise on the cells. Gently 
rock the plate to ensure an even distribution of transfection 
mix. Place the cells back into the incubator. 


. After 5 h replace the transfection medium with fresh Advanced 


DMEM including the above-listed additives and return the 
cells back into the incubator. At this time point the medium 
may already contain viral particles. BSL2 precautionary mea- 
sures have to be undertaken when working with lentiviral 
vectors. 


. Day 3: As target cells seed MCF7 cells, for lentiviral vector 


transduction, or C2C12 cells, for retroviral vector transduc- 
tion, in 6-well plate with a density of 10° and 0.3 x 10° cells/ 
cm’, respectively, according to the procedure described in 
Subheading 3.1. 


. Day 4. After 48 h incubation the transfected HEK293T/ 


Platinum E cells have produced a sufficient amount of viral 
particles. Collect the supernatant, which contains either the 
retro- or lentiviral particles, using a serological pipette and filter 
it through a 0.45 pm sterile filter into a 50 ml conical centri- 
fuge tube (see Note 9). Add polybrene solution to a final con- 
centration of 8 g/ml to the viral particle-suspension. 


. Replace the medium of the target cells with a total of 2 ml viral 


particle-suspension. For a double transduction use 1 ml of each 
viral particle-suspension (e.g., 1 ml of pWM043 and pWM023 
harboring vectors). Next place cells back into the incubator for 
15 min. After incubation centrifuge plates for 30 min at 1100 xg 
(see Note 10). Subsequently to centrifugation discard viral 
particle containing medium and add fresh culture medium 
before returning cells into the incubator again. 


3.2 Light Experiment 
and Readout 


3.2.1 Western Blot 
Analysis 


1. 


1. 
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. Day 5: The transduced cells should have reached a confluence 


of ~80% before continuing. Switch off ambient light and turn 
on the safe lights. Aspirate the medium and wash the cells once 
with PBS before adding 150 pl trypsin-EDTA solution. Place 
cells back into incubator for ~3 min or until they detach. 


. Wash cells off the plate with 2 ml culture medium, transfer 


them to a new 10-cm plate, and add culture medium up to a 
total of 10 ml (see Note 11). Keep cells in incubator until they 
have reached ~80% density. 


. The cells now stably express the transduced constructs (see 


Note 12). Seed the cells (10° and 0.3 x 10° cells/cm? for 
MCEF7 and C2C12 cells, respectively) in either 6-well plates 
(for Western blot analysis) or on coverslips in 24-well plates 
(for fluorescence microscopy). Again, include a replicate on a 
separate plate for each condition as dark control. 


Day 1: Start with optoAKT expressing cells on 6-well or 
24-well plate transfected/seeded the day before (see 
Subheadings 3.1.1 and 3.1.2, respectively), which should have 
a confluence of ~80% in the evening. Replace culture medium 
with starvation medium (see Note 13) and place plates back 
into incubator. It is important that the cells are not exposed to 
any source of irradiation (390-480 nm) that would inadver- 
tently excite the optogenetic system during starvation. 


. Day 2: In the morning (~12-15 h post starvation) cells are 


ready for the experiment. The exact conditions of irradiation 
vary depending on the desired readout. 


For analysis of optoAKT by Western blotting set the intensity 
of the LEDs in the light box to 150 pmol/(m? s) and irradiate 
the cells 15 times for 5 s/min or constantly with a lower inten- 
sity (see Note 14). Intensities higher than 50 pmol/(m? s) may 
induce photo-cytotoxic effects when applied for extended time 
periods. Before further processing the cells ensure that ambi- 
ent light is turned off and safe light is turned on (see Note 15) 
and that there are no additional sources of light which acciden- 
tally could activate the optogenetic system. 


. Aspirate medium and wash cells once with PBS before adding 


250 ul ice-cold modified TENT cell lysis buffer. Gently shake 
the plates to evenly distribute the lysis buffer. Incubate the 
plates for 10 min on ice. 


. Use a cell scraper to detach the cells and transfer the cell lysate 


into a precooled 1.5 ml Eppendorf tube. Sonicate the lysate for 
15 min (30 s/min pulses) at 4 °C to fragment DNA and lower 
viscosity. In the next step, centrifuge lysates for 15 min at 
10,000 x g and 4 °C to pellet the insoluble components. 
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3.2.2 Fluorescence 
Microscopy 


Transfer 200 pil of supernatant into a new Eppendorf tube and 
add 50 pl 5x SDS-loading buffer (see Note 16), next boil the 
sample for 5 min at 95 °C (see Note 17). The lysates can be 
stored at —20 °C when not directly used. 


. For SDS-PAGE, load protein ladder and 10 ul of each sample 


onto a 9% SDS-gel with 15 pockets (8.3 x 6.4 cm). Conduct 
gel electrophoresis in running buffer at 90 V for approxi- 
mately 2 h. 


. For the semidry Western blot using a PVDF membrane, the 


membrane has to be activated in the first step. Therefore, incu- 
bate the membrane 1 min in methanol and then incubate for 
another minute in transfer buffer. At the same time, incubate 
two Whatman paper per gel in transfer buffer. Break open the 
SDS-gel chamber and assemble a “sandwich” consisting out of 
Whatman paper -- PVDF membrane -- SDS gel -- Whatman 
paper (anode to cathode) in a semidry blotter. Perform Western 
blot for 1 h at 0.35 A/SDS-Gel. 


. Afterward block the membrane in TBS-T supplemented with 


5% BSA (v/v) for 1 h. Dilute primary antibodies 1:1000 in 
TBS-T supplemented with 5% BSA (v/v) and add them to the 
membrane. Incubate overnight at 4 °C. 


. Day 3: Wash the membrane thrice with TBS-T for 10 min each 


time. Dilute secondary antibody coupled to horse radish per- 
oxidase 1:10,000 in TBS-T supplemented with 5% BSA (v/v) 
and incubate membrane for | h at room temperature. Again 
wash the membrane thrice with TBS-T for 10 min each time. 


. Apply ECL I/ECL II mixture (1:1) onto the membrane incu- 


bate for some seconds and detect chemiluminescence with 
luminescent image analyzer (Fig. 2). 


. Day I: Place one of the 24-well plates you transfected/seeded 


(see Subheadings 3.1.1 and 3.1.2) the day before into a light 
box while the control plate remains in the dark. Afterward irradi- 
ate cells with 1.5 pmol/(m? s) for 5 min. 


. Switch on safe lights before taking cells out of the incubator to 


prevent uncontrolled activation of the optoKinase system. 
Subsequently, wash them once with DPBS before adding 
200 pl 4% paraformaldehyde cell fixation solution per well 
(see Note 18). 


. Let the cells incubate for 10 min on ice and subsequently for 


additional 10 min at room temperature. Afterward, the sam- 
ples do not have to be handled under safe light anymore. 


. Apply 7 pl mowiol (with DABCO) onto a microscope slide, try 


to avoid generating air bubbles. 


. Use pincers to pick up cover slip and dip it a couple of times 


into water to remove excess of paraformaldehyde. Use a cos- 
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Fig. 2 Western blot analysis of cells expressing optoAKT. (a) HEK293T cells were transiently co-transfected with 
CRY2-EGFP-AKT1 (pGR427) and either membrane anchored CIBN-mCherry-CaaX (pGR464) (+) or junk DNA (-). 
After overnight serum-starvation those cells were irradiated with blue light (450 nm, 40 ymol/(m? s)) for 15 min (+) 
or kept in the dark (—). Whole cell lysates were prepared and subjected to Western blot analysis using the designated 
antibodies. (b) MCF7 cells co-transduced with CRY2-EGFP-HA-AKT1 jy (PWM043) and membrane anchored m/p- 
mCherry-CIBN (pWM023) were serum-starved overnight. Afterward, those cells were either irradiated with blue light 
(450 nm) with an intensity of 150 j:mol/(m? s) for 5 s/min 15 times (Pulse), with a constant intensity of 5 :mol/(m? s) 
for 15 min (Const) or kept in the dark (—). Whole cell lysates were prepared and equal amounts of lysate were sub- 
jected to Western blot analysis using the designated antibodies. endog. AKT endogenous AKT 


m/p-mCherry-CIBN CRY2-YFP-AKT1 merge 


C2Cc12 


Fig. 3 Fluorescence microscopic analysis of light-induced plasma membrane recruitment. Transduced cells, 
co-expressing photoactivatable CRY2-YFP-AKT1 [6] and membrane anchored m/p-mCherry-CIBN (pWM029), 
were seeded on cover slips and serum-starved overnight. Serum-starved cells were then either irradiated with 
blue light (450 nm, 1.5 pmol/(m? s)) for 5 min or kept in the dark. Afterward, cells fixated with paraformaldehyde 
were analyzed by fluorescence microscopy. Scale bar: 10 pm 


metic tissue to absorb the water and place the coverslip with 
the cells first on the drop of mowiol. Let the slides dry over- 
night in the dark to prevent bleaching. 


6. Day 2: Use nail polish to seal the samples before looking at 
them under the fluorescence microscope (Fig. 3). 
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4 Notes 


1. For a detailed description of the light boxes, see refs. 16, 17. 


10. 


ll. 


12. 


. It is also possible to use other cell lines. However, be aware 


that signal transduction can differ between cell lines and that 
the procedure described here might not be directly transfer- 
able to other cell lines. 


. When no CASY cell counter is available, a hemocytometer in 


combination with a bright field microscope can be used to deter- 
mine the cell number. If there is also no hemocytometer available, 
the amount of cells to be seeded can be vaguely estimated. 
One dense 10-cm plate contains approximately 1.5 x 107 cells. 


. When rocking the plate try to form a lying eight or perform a 


north-south followed by an east-west movement. Avoid circu- 
lar movement, since this is causing an accumulation of the cells 
in the middle of the plate. Once placed in the incubator do not 
move the plate for at least a couple of hours to ensure attach- 
ment of the cells. 


. To optimize the optoSystem the amount and/or the ratio of 


CRY2 and CIBN constructs can be adjusted. 


. Perform same movement as for seeding to prevent local accu- 


mulation of transfection mix. 


. The generation of a stable cell line takes time to begin with but 


is going to save time later. Furthermore, the overall expression 
level of the recombinant proteins can be further refined by cell 
sorting. 


. Instead of using a packaging cell line (i.e., Platinum E) the 


generation of retroviral vectors can be conducted by transfecting 
HEK293T cells with the appropriate helper plasmids. 


. When filtrating viral particles use a filter with low protein- 


binding properties. PVDF and nitrocellulose filter will bind 
viral particles and strongly reduce transduction efficiency. 


Wrap plates carefully with parafilm to prevent accidental splash- 
ing of viral particles and to delay equilibration of CO, with 
surrounding. 


In case trypsin-sensitive cells are used it may be advisable to 
centrifuge the cells and discard the supernatant to remove all 
trypsin; alternatively, an enzyme-free detachment solution can 
be used. 


It is possible (and a good idea) to expand the stable cell line 
generated at this point or after a first experiment, ensuring 
proper functionality of the optoSystem. This allows skipping 
future work under BSL2 conditions and also provides a backup 
of cells. 


13. 


14. 


15. 


16. 


17. 


18. 
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The residual amount of culture medium after aspiration gains 
increasing significance with decreasing surface area. Therefore, 
washing the wells with PBS in case of small well format is 
recommended (i.e., 48- /96-well format). 


Light intensities of 5 pmol/(m* s) or lower are strongly recom- 
mended, since they sufficiently trigger the optogenetic System, 
while showing negligible phototoxicity. 


Illuminate the cell culture lab by safelight LEDs (1.e., wave- 
length longer than 500 nm). Measure the wavelength of LEDs 
to ensure that they will not excite the blue light-responsive 
CRY2/CIBN system. 


After the centrifugation the almost clear pellet may be hard to 
detect. Make sure not to transfer it, since it will lower the qual- 
ity of the Western blot. 

When boiling samples use Eppendorf Safe-Lock tubes. This 
will prevent accidental opening of the tubes during boiling, 
which leads to uneven protein amounts between samples. 

Do not process too many samples at once. In case of the 
CRY2/CIBN system, the light-induced interaction of the 
fusion proteins is limited when working under safe light. 


This work was supported by the Excellence Initiative of the German 
Federal and State Governments (EXC-294, BIOSS — Centre for 
Biological Signaling). We thank J. Schmidt, D. Schiachtele, and 
J. MefSmer (University of Freiburg) for designing and constructing 
the illumination boxes. 
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Chapter 17 


Yeast-Based Screening System for the Selection 
of Functional Light-Driven K* Channels 


Cristian Cosentino, Laura Alberio, Gerhard Thiel, and Anna Moroni 


Abstract 


Ion channels control the electrical properties of cells by opening and closing (gating) in response to a wide 
palette of environmental and physiological stimuli. Endowing ion channels with the possibility to be gated 
by remotely applied stimuli, such as light, provides a tool for in vivo control of cellular functions in behaving 
animals. We have engineered a synthetic light-gated potassium (K*) channel by connecting an exogenous 
plant photoreceptor LOV2 domain to the K* channel pore Kev. Here, we describe the experimental strat- 
egy that we have used to evolve the properties of the channel toward full control of light on pore gating. 
Our method combines rational and random mutagenesis of the channel followed by a yeast-based screen- 
ing system for light-activated K* conductance. 


Key words Functional complementation, Protein evolution, Rational and random mutagenesis, 
Light, Screening, S. cerevisiae, Optogenetics, Ion channels, Potassium (K*), Gating 


1. Introduction 


Potassium (K*) channels are membrane proteins ubiquitous in liv- 
ing organisms. They drive the flux of K* ions across the ion- 
impermeant lipid membrane. In higher organisms, they play a 
fundamental role in controlling cell excitability and intercellular 
communication, particularly between neurons. Potassium channels 
are tetrameric proteins in which each monomer contributes to the 
formation of a central ion conductive pore surrounded by regula- 
tory or sensor domains. All K* channels operate according to a 
common principle: upon perception of the stimuli, the peripheral 
sensors convey the information to the pore that opens or closes in 
response (gating). Given their common modular architecture, K* 
channels tolerate swapping of pore and sensor modules between 
members of different families without losing functionality [1, 2]. 
More surprising is the evidence that exogenous sensors found in 
proteins unrelated to the K* channel superfamily can be grafted on 
the pore to control gating. In the past, we have engineered a 
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functional voltage-gated K* channel by connecting the voltage sensor 
of CiVSP, a phosphatase found in the ascidian Cizona intestinalis, to 
the pore of a Kcv, a K* channel found in the algae virus PBCV-1 
[3]. An exciting perspective is the expansion of signals perceived by 
K* channels beyond the realm of those already sensed by native 
channels. Regulation of K* channels by infrared light, magnetic 
fields or ultrasounds that freely penetrate mammalian tissues, will 
offer the unprecedented possibility to control cellular functions in 
freely moving and behaving organisms. 

We recently succeeded in engineering BLINK1, a blue light 
(475 nm)-gated K* channel that can be used to inhibit the electri- 
cal activity of neurons in optogenetics [4]. BLINK was engi- 
neered by connecting the light sensor module LOV2 of Avena 
sativa phototropin [5] to the N-terminus of Kev (Fig. la). Crucial 
for the project was the development of a yeast-based screening sys- 
tem that allowed fast screening for functional K* channels. We had 
previously shown that Kcv expression rescues growth of S. cerevi- 
siae Atrk1 Atrk2 mutants in low external K* [6, 7]. In this project, 
we were further screening for the ability of the engineered chan- 
nels to promote differential growth in light and dark. A prototype 
channel retrieved from the first round of screening was further 
improved through a random mutagenesis approach that generated 
a library of mutants. In a second round of screening, about 30,000 
clones from the library were transformed in yeast and the colonies 


L K LK 
[| Ceocs*—oe >. Imam 
es OO ees 


Fig. 1 Schematic representation of the engineering principles of the light- 
activated K* channel and of the yeast-based screening assay used to identify 
functional light-driven K+ channels. (a) The prototype light-gated K* channel (LK) 
was engineered by fusing the LOV2-Ja domain (L) to the viral Kt channel Kcv (A). 
Kand LK are inserted in the lipid membrane while L is cytosolic. Nand C indicate 
the first and the last aminoacid of each construct. (b) Yeast complementation 
assay for the detection of functional light-regulated K* channels: the mutant 
strain SGY1528 does not grow in low external K* concentration (4 mM) when it 
is transformed with L, neither in the dark nor in light. Expression of the functional 
K* channel Kcv (A) restores yeast growth, independently of light and dark growth 
conditions. Expression of LK, promotes differential yeast growth in light and dark, 
indicating light regulation of channel gating 


2 Materials 


2.1 Coding 
Sequences 

for the LOV2 Domain 
and the Kev Channel 


2.2 Expression 
Vector 


2.3 Cloning 
and Expression 
Organisms 


2.4 E.coli 
Growth Media 


2.5 Yeast 
Growth Media 
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tested for differential yeast growth in light/dark. Channel variants 
showing improved complementation ability, compared to the 
parental channel, were selected for further electrophysiological 
characterization of the channel currents. 


1. LOV2 domain: aminoacids 404-546 of Avena sativa 
Phototropin 1 (NPH1-1) (GenBank: AAC05083.1). 


2. Kev: aminoacids 2-94 of Paramecium bursaria Chlorella virus 1 
potassium ion channel protein (PBCV-1-Kcev) (NP_048599.1). 


1. The pYES2-Met25 expression vector [6] was used for protein 
expression in Saccharomyces cerevisiae. 


1. Escherichia coli (DH5a) for cloning procedures and plasmid 
DNA amplification. 


2. Saccharomyces cerevisiae Atrkl1 Atrk2 mutant (SGY1528) for 
functional complementation by light-driven K* ion channels. 


1. LB medium: add 10 g Tryptone, 5 g Yeast Extract, 10 g NaCl 
to 800 mL ultrapure water. Mix and adjust pH to 7.0 with 1 M 
NaOH. Make up to 1 L with water. Autoclave and store at 
RT. Before use, add selective antibiotics. For solid LB medium 
proceed as previous step and add 15 g/L agar. After steriliza- 
tion allow the medium to cool down, add selective antibiotics, 
and then pour medium into petri dishes. Seal solid plates with 
parafilm and store in the dark at +4 °C (see Note 1). 


2. Ampicillin: 50 mg/mL stock solution. Weigh 500 mg 
Ampicillin sodium salt and dissolve in 10 mL ultrapure water. 
Make aliquots and store in the dark at —20 °C (see Note 2). 


3. Gentamycin: 20 mg/mL stock solution. Weigh 200 mg 
Gentamycin sulfate; prepare a 10 mL solution and aliquot as in 
the previous step (see Note 2). 


1. Complete Yeast Medium supplemented with 100 mM KCl 
(YPDA + 100 KCl): weigh 20 g Peptone, 20 g D-Glucose, 
10 g Yeast Extract, 50 mg Adenine hemisulfate, 7.4 g KCI and 
add to 800 mL ultrapure water. Mix and adjust pH to 5.6 with 
1 M HCI; make up to 1 L with water and autoclave. To obtain 
solid medium, add 20 g Agar (Plant cell culture tested, Sigma- 
Aldrich) before autoclaving. Pour agar-containing medium 
into petri dishes while still molten and seal with parafilm when 
cooled. Store liquid medium at RT and plates at +4 °C. 


2. Minimal SD medium without Uracil and supplemented with 
Adenine and 100 mM KCI (SD-U + 100 KCl): Weigh 26.7 g 
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2.6 Mutation 
Strategies 


2.6.1 Enzymes 
and Reagents 


on ek WwW NY 


Minimal SD base (Clontech), 765 mg CSM dropout Uracil 
(CSM- Ura; Formedium), 100 mg Adenine hemisulfate, 7.4 g 
KCl and dissolve in water as described in the previous step. 
Mix and adjust pH to 5.5 with 1 M KOH. Make up to 1 L 
with water. To obtain solid medium, add 20 g Agar (Plant cell 
culture tested, Sigma-Aldrich). Autoclave and store liquid 
medium at RT, pour agar plates as previously described, and 
store at +4 °C. 


. Selective medium without Uracil and Methionine, supple- 


mented with Adenine and KCl (SEL-U-M + KCl): weigh 
725 mg L-Arginine, 100 mg Adenine hemisulfate, 1.5 g CSM 
dropout Uracil and Methionine (CSM-Ura-Met; Formedium), 
10 g D-Glucose and add to 800 mL of water. Add 2 mL of 
500 mM MgSO, and 200 pL of 500 mM CaCl,. Supplement 
with KCl at the desired final concentration diluting a 2 M stock 
solution (see Note 3). Mix and adjust pH to 6.0 with Phosphoric 
acid. Make up to 1 L with water. Autoclave and store liquid 
medium at RT. Before use, add 1 pL Vitamin solution and 
1 pL Trace Elements per each mL of media. For solid medium, 
add 15 g Low Potassium Agar (Sigma-Aldrich #P9338). After 
sterilization, allow the medium to cool down and add Trace 
Elements and Vitamins as previously described; pour medium 
into petri dishes. Seal solid plates and store at +4 °C in the dark 
(see Note 4). 


. Vitamin solution: weigh and add in a 50 mL tube 10 mg 


Biotin, 20 mg D-Panthotene acid calcium salt, 20 mg Nicotinic 
acid, 20 mg Pyridoxine HCl, 20 mg Thiamine, and 10 mg 
Myo-inositol. Add ultrapure grade water up to 50 mL, mix 
and filter the solution through 0.2 pm syringe filter. Store at 
+4 °C, in the dark (see Note 5). 


. Trace elements: weigh 25 mg Boric acid, 2 mg CuSO4, 5 mg 


KI, 25 mg FeCl3, 20 mg MnSO4, 45 mg Molybdic acid, 
20 mg ZnSO,, 500 pL HCl 37% and dissolve in 50 mL of 
water. Sterilize the solution as described in the previous step. 
Store at +4 °C, in the dark (see Note 5). 


. High-fidelity PCR Amplification System (e.g., Pfu DNA 


polymerase). 


. QuikChange II XL Site-Directed Mutagenesis Kit. 

. GeneMorph II Random Mutagenesis Kit. 

. Restriction Enzymes. 

. Commercial PCR purification Kit (e.g., QIAquik PCR 


Purification Kit). 


. Commercial DNA gel extraction kit (e.g., Wizard SV Gel and 


PCR Purification Kit). 


2.6.2 PCR Primers 


2.7 Light Irradiation 


7. 


8. 
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Commercial DNA plasmid purification kit (e.g., QIAprep Spin 
Miniprep Kit). 

Commercial yeast transformation kit: e.g., the Frozen-EZ 
yeast transformation IT kit (Zymo Research). 


Primer characteristics for each application: 


1. 


1 


Overlap extension PCR (OE-PCR): presence of G or C at 5’ 
end; T;, between 64 and 66 °C; the overlapping region with 
the target sequence has to be 15-22 bp long. 


. Terminal Extension PCR (TE-PCR): 5’ extension can be up to 


35 bp long; T},, of region annealing to target sequence has to 
be between 64 and 66 °C. 


. Site directed mutagenesis PCR (SDM-PCR): primer length 


should be between 42 and 45 bp; the desired mutation has 
to be close to the center of the primer sequence; the T,, 
(calculated as per QuikChange IT XL Site- Directed Mutagenesis 
Kit) has to be greater than 78 °C. 


. Error-prone PCR (EP-PCR): the primer binding sites are 


located on the vector sequence at a distance greater than 70 bp 
from desired restriction sites. T,, is between 64 and 66 °C. In 
Error-Prone PCR reactions, the required primers stocks con- 
centration is 250 ng/uL. 


. Functionally effective Blue light: Tri-Star Rebel LED (Luxeon 


#MR-RO0500-20T) containing three Royal-Blue LEDs, mounted 
on 20 mm Tri-Star Saber base: 2730 mW at recommended 
operating current 700 mA; A = 447.5 nm (440-460 nm); beam 
angle 125°. 


. LED Heat Sink: Alpha Heat Sink (Luxeon #N80-20B) with 


convection thermal resistance rating of 2.85 °C/W and 
80 x 80 x 20 mm footprint. 


. LED adhesive tape: thermally conductive, electrically isolating 


and adhesive Bond-Ply 100 pad (Luxeon #LXT-T-12) cut to 
fit the 20 mm Luxeon LED’s Tri-Star base. Operational tem- 
perature from —30 °C to 120 °C; thermal performance 


4.5 °C/W. 


. Functionally ineffective Red light: Tri-Start Rebel LED 


(Luxeon #MR-H2060-20T) containing three Red—Orange 
LEDs, mounted on 20 mm Tri-Star Saber base: 366 mW at 
recommended operating current 700 mA; 4 = 617 nm 
(610-620 nm); beam angle 125°. 


. Safety light: Deep-Orange co-extruded polycarbonate film 


(Rosco Supergel #22) applied to main room light source: RGB 
240:80:0; transmittance ~0% at 4 <540 nm. 
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2.8 Light/Dark The screening is performed in a temperature-controlled chamber 

Differential Screening where culture plates can be either exposed to homogeneous light 

Chamber irradiation or kept in the dark. Figure 2 shows a schematic drawing 
of the chamber, from the side view (Fig. 2a) and from the top 
(Fig. 2b). 


1. Homogenous light irradiation is provided as follows: a metal 
scaffold is built around a 1 m? surface, in order to place 9 
LED assemblies 70 cm above cellular cultures (see Note 6). 
Each LED assembly is composed of a heat sink mounted on 
the top of a Tri-star LED, through thermal conductive adhe- 
sive tape. The thermal sink allows an adequate passive cooling 
system that ensures long-time LED functionality (Fig. 2a). 
LED assemblies are mounted on three metal bars to allow 


Functionally ineffective A 


v “ : {-] Heat sink 
LED assembly f — Rebel Tri-Star LED 


emma Thermal adhesive tape 


White 
panel 


incubated 
petri dishes 


Power 
supply 


LED assembly 


meme White panel 


Structural schaffold 


sevesnensnnne Electrical cables 


e Cable connector 


Fig. 2 Growth chamber for yeast-based light/dark complementation assay: (a) side view of the incubation 
chamber; (b) top view of the workbench. Detailed scheme of LED assembly and LED components is shown at 
the right side of each panel 


3 Methods 


3.1 Preparation 
of S. cerevisiae 
Competent Cells 


3.2 Transformation 
of S. cerevisiae Cells 
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homogenous radiation distribution over the table surface. 
LED assemblies are grouped in series and groups are in paral- 
lel with each other, according to power supply specifications. 
This connection system allows easy removal of a single LED 
in case of technical maintenance (Fig. 2b). White panels 
reflecting irradiated light are placed around the table, in order 
to homogeneously distribute light over the plate surface. A 
space between the panels and the table was left to allow air 
circulation. Moreover, the open conformation of the scaffold 
ensures homogenous temperature conditions all over the 
chamber. 


2. Dark condition is obtained by means of a dark cardboard and 
a black cloth covering the plates and allowing air circulation. 


The selection assay is based on a S. cerevisiae mutant Atrk] Atrk2 
(SGY1528) that cannot grow in low external K*. Expression of 
heterologous K* transport proteins restores growth in low K’. 
Functional synthetic channels were identified by their ability to 
complement the yeast growth phenotype in low external K*. 


1. Grow SGY1528 cells on YPDA + 100 mM KCI solid medium 
up to 3 days at 30 °C. 

2. Inoculate a single colony in 3 mL YPDA + 100 mM KC] broth 
and grow overnight in 30 °C shaking incubator (see Note 7). 


3. Dilute cell culture 1:50 in YPDA + 100 mM KCI broth and 
grow in 30 °C shaking incubator until ODgo9 is between 0.8 
and 1.0 (see Note 8). 


4. Centrifuge cells at 500 x g for 4 min, and then discard 
supernatant. 


5. Commercially available kits such as the Frozen-EZ yeast trans- 
formation II provide reagents and protocols to obtain competent 
yeast cells with high transformation efficiency. 


6. Aliquot cells in small volumes and store at —80 °C (see Note 9). 


Commercially available kits such as the Frozen-EZ yeast transfor- 
mation II allow for a high yield in yeast transformation 
procedures. 


1. Follow manufacturer’s instruction to transform cells (see 
Note 10). 


2. Plate up to 5 pL cells over 1/3 of a minimal SD medium plate. 
3. Incubate plates up to 3-4 days at 30 °C. 
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3.3 Rational 
and Random 
Mutagenesis 
Strategies 


3.3.1 Rational 
Mutagenesis: Terminal 
Extension PCR (TE-PCR) 


3.3.2 Rational 
Mutagenesis: Overlap 
Extension PCR (OE-PCR) 


Two main strategies were followed to build a synthetic light-driven 
ion channel: first, a rational approach was pursued, which consisted 
in connecting the sensor to the pore module in different positions 
guided by what is known on the gating mechanism of the channel. 
This was followed by a random mutagenesis approach aiming to 
improve the properties of a promising candidate retrieved from the 
previous approach. Rational mutagenesis was performed using a 
combination of different PCR amplification methods. The stan- 
dard thermal cycler program in combination with Pfu DNA poly- 
merase is as follows: step 1, 30 s at 95 °C; step 2 (15-35 cycles), 
10s at 95 °C, 20s at Ti, 1 min per kb of amplicon at 72 °C; step 
3, 5-7 min at 72 °C; step 4, 7 °C for unlimited time. 


TE-PCR extends both termini of a target sequence for further 
sequence manipulation (e.g., restriction enzyme cloning; OE-PCR; 
motif addiction; see Fig. 3a). To set up PCR parameters, only the 
primer region annealing to the target sequence has to be taken into 
account. The reaction is based on a standard PCR protocol based 
on Pfu DNA polymerase, following the standard reagent concen- 
tration and the standard thermal cycler program as per manufac- 
turer’s instructions. 


The OE-PCR allows joining of regions originally separated in the 
native sequence (see Fig. 3b). Two partially overlapping amplicons 
are generated through terminal extension PCR. 

OE-PCR occurs with an initial step using a mixture of these 
amplicons as reciprocal primers; following this step, a standard 
PCR protocol enriches the newly generated construct. 


1. Generate partially overlapping amplicons through TE-PCR 
(see Subheading 3.3.1) using a combination of one TE-PCR 
primer and one OE-PCR primer: the resulting fragments will 
have a common annealing region. 


2. Isolate amplicons running the PCR reaction on 1% Agarose gel 
and purify the desired fragment with any commercially available 
DNA gel extraction kit. 


3. First OE-PCR reaction: mix within a PCR tube 0.75 U Pfu 
DNA polymerase, 1x Pfu polymerase buffer, 200 pM dNTPs, 
40 ng of the longest and partially overlapping amplicon; 1:1 
molar ratio of the shorter overlapping amplicon; make final 
volume up to 50 pL with ultrapure water. 

4. Set up standard PCR protocol, with following modifications: 
Step 2, 15 cycles and T}, at 55 °C; skip step 3, move directly 
to step 4 (7 °C). 

5. Second OE-PCR reaction: add 0.5 pL of 10 pM primers 
(forward and reverse) to allow full amplification of the newly 
generated construct. 


a Terminal extension 
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Fig. 3 PCR-based cloning methods: (a) terminal extension PCR (TE-PCR) workflow; (b) overlap extension PCR 
(OE-PCR) workflow; (c) site directed mutagenesis (SDM-PCR) workflow 


6. Set up standard PCR protocol, with following modifications: 


Ta oe "Cy 


7. Isolate amplicons running the reaction on 1% agarose gel and 
purify the desired fragment with a commercially available DNA 
gel extraction kit. 


3.3.3 Rational 
Mutagenesis: Directed 
Mutagenesis PCR 
(SDM-PCR) 


This procedure introduces amino acid substitutions, insertions, or 
deletions into a target sequence (see Fig. 3c). For this purpose, 
commercial products as QuikChange site-directed mutagenesis kit 
are available. 


1. Perform SMD-PCR as per manufacturer’s guide, with minor 
adaptations: T;, value and elongation time regulation, based on 
the target sequence. 


2. Transform PCR product into E. coli cells, extract and purify 
the obtained plasmid. 


3. Verify the presence of the desired mutation by sequencing. 


3.3.4 Random 
Mutagenesis: Error-Prone 
PCR (EP-PCR) 


EP-PCR randomly mutates a target region by using an error-prone 
DNA polymerase. Commercially available kits such as GeneMorph 
II random mutagenesis are specifically designed for this purpose. 


EP-PCR primers used in this reaction were designed to perfectly 
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match the target plasmid sequence, in order to allow subsequent 
recombination in gap repair (Fig. 4). 


1. The following setup was designed to generate 10-15 random 
point mutations per 1 Kb of target DNA as per instructions of 
the GeneMorph II random mutagenesis kit. Use 50 ng target 
DNA (e.g., 510 ng of 5.5 Kb plasmid containing a 560 bp 
long target sequence) and run the amplification reaction as 
follows: step 1, 95 °C for 2 min; step 2, 95 °C for 30 s, 64 °C 
for 30 s, 72 °C for 1 per 1 Kb amplified, repeated for 30 cycles; 
step 3, 72 °C for 10 min. 


2. Use 2 pL of the PCR product as a template for another EP- 
PCR. Repeat this step twice. 


3. Isolate amplicons running the reaction on 1% agarose gel and 
purify the desired fragment with a commercially available DNA 
gel extraction Kit. 


4. Proceed to yeast transformation and gap repair. 


3.3.5 Random Gap repair allows the generation of libraries of a randomly mutated 

Mutagenesis: Gap Repair clone within the same non-mutagenized vector background 
(see Fig. 4). High efficiency yeast transformation is recommended. 
Gap repair transformation requires an open plasmid, with terminal 
regions annealing to the terminal regions of the mutagenized frag- 
ment that has to be inserted (see Note 11). 


RE2 asain > —— 
EP-PCR library 


—_ 
P2 
Input clone 


o oa 
a QA 


Open vector 


Gap Repair 


vy 


% Random mutation © ¥ 


Yeast library 


Fig. 4 Gap repair-based library preparation in yeast. RE stands for restriction site; P stands for EP-PCR primer 


3.4 Light-Driven 
Functional 
Complementation 


3.4.1 Low-Throughput 
Screening by Drop-Test 
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1. To prepare the vector for a gap repair reaction, prepare a 
restriction digest of the plasmid by mixing in a tube 10 pg of 
circular pYES2-Met25 expression plasmid, 50 U of each 
restriction enzyme, 1x reaction buffer according to restriction 
enzyme manufacturer’s instructions. Make final volume up to 
50 pL with water (see Note 12). 


2. Incubate at the appropriate temperature following restriction 
enzyme manufacturer’s instructions for more than | h 30 min. 


3. Isolate digested fragments running the reaction on 1% Agarose 
gel and purify the linearized plasmid with any commercially 
available DNA gel extraction kit. 


4. To perform a gap repair reaction, prepare into a 2 mL tube a 
mixture of 500 ng EP-PCR randomly mutated amplicons with 
1 pg of cut yeast expression vector. 


5. Transform the mixture into 50 pL SGY1528 competent cells 
using any commercially available yeast transformation kit, pro- 
viding high transforming rate. When transforming yeast cells 
with Frozen-EZ yeast transformation II kit, eight transforma- 
tion reactions are enough to plate cells onto 36 @ 150 mm 
petri dishes (see Note 13). 


6. Dilute transformed cells with MilliQ water up to 6.5 mL and mix. 


7. Plate 180 pL of diluted cells on 36 @ 150 mm petri dishes 
containing SD-U + 100 mM KCl 


8. Number the plates and provide each with a mark reporting the 
orientation for the following replica plating step. 


9. Incubate at 30 °C in the dark up to 4 days. 


The yeast mutant strain SGY1528 (see Note 14) is suitable for 
functional complementation tests of light-gated synthetic potas- 
sium channels inserted in pYES2-Met25 vector. Functional com- 
plementation is performed on SEL-U-M medium supplemented 
with KCl at a selective concentration. The differential growth 
between light and dark exposed colonies determines which clones 
are functional. Light-driven functional complementation can either 
be performed in low throughput by means of a drop-test assay or 
in high-throughput. 


The drop test method allows fine screening of tens of colonies on 
a single plate by functional complementation. Tenfold serial dilu- 
tions of liquid cell culture are spotted onto solid selective medium 
and grown under light irradiation or in the dark. The differential 
growth rate of the same colony, at the same dilution, at the two 
different conditions, reflects a different ability in complementing 
the defective yeast phenotype. 
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Pick up the desired colony from solid nonselective medium 
(SD-U + A+ 100 mM KCl) and inoculate it in 3 mL of liquid 
nonselective medium. Grow cells overnight in 30 °C shaking 
incubator in the dark. 


. Centrifuge 2 mL of the overnight cell culture at 2700 x g for 


2 min and discard supernatant. 


. Resuspend pellet with 2 mL sterile water to wash cells. 


. Centrifuge cells at 2700 x g for 2 min and discard the 


supernatant. 


. Add 1 mL sterile water and resuspend cells. 


. In a different tube, transfer 50 pL resuspended cells and add 


950 pL water (1:20 dilution). 


. Measure ODo69 of the 1:20 dilution and calculate the OD¢o9 of 


the resuspended cells. 


. Dilute the resuspended cells to OD609 = 0.8 with water. 


. Prepare tenfold serial dilutions in 2 mL tubes: transfer 100 pL 


cell from ODgo9 = 0.8 preparation in a new tube and add 
900 pL water (1:10 dilution). After mixing, repeat the same 
procedure starting from 1:10 dilution to obtain 1:100 dilution 
and from the 1:100 dilution to obtain 1:1000 dilution. 


Spot 7 pL drops of each dilution onto plates of SEL-U-M 
medium, supplemented with KC] at the desired selective con- 
centration. Prepare each plate in duplicate to test simultane- 
ously two functional conditions. 


Seal plates with parafilm and incubate inverted for 3 days at 
30 °C under light or dark conditions in the chamber described 
in Subheading 2.8. 


To purify the plasmid from selected colonies, pick the desired 
colony from solid nonselective medium and inoculate in 3 mL 
of liquid nonselective medium. 


Grow cells overnight in a shaking incubator at 30 °C in the 
dark. 


Purify yeast plasmid with any commercially available plasmid 
extraction kit (see Note 15). 


In a new 2 mL tube transform 10 pL DH5« E. cols competent 
cells with 1 pL of the yeast extracted plasmid. 


Plate 90 pL transformed cells on LB agar plate supplemented 
with selective antibiotics and incubate overnight at 37 °C. 


Inoculate colonies in a 50 mL tube containing 7 mL LB broth, 
supplemented with selective antibiotics; grow cells overnight 
at 37 °C in a shaking incubator. 


Proceed as per any commercially available DNA plasmid puri- 
fication kit workflow. 


3.4.2 High-Throughput 
Screening by Replica 
Plating 


4 Notes 
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This step allows the replication of the same colony over two plates 
with selective medium. The replica plates are then incubated under 
different growing conditions and differentially growing colonies 
identified and selected for further analysis. 


1 


. With a sterile velvet cloth, replicate colonies grown onto each 


nonselective plate onto two different @150 mm plates con- 
taining SEL-U-M+A supplemented with the desired selective 
KCl concentration. 


. Seal and number the two selective plates according to mother 


plate number and selection condition and mark each to report 
the orientation as for the nonselective plate (see Note 16). 


. Incubate plates at 30 °C in the growth chamber. One subset of 


plates must be placed under effective light irradiation, the 
second subset must be grown in the dark (see Note 17). 


. After 3 days, light-irradiated and dark grown plates are compared 


to find differentially grown yeast colonies (see Note 18). 
Colonies that have been identified by differential growth in 
replica plating must be validated to eliminate false positives. 


. To validate the phenotype, identify the colonies that show 


differential growth between light and dark conditions follow- 
ing replica plating. 


. Inoculate the selected colonies in liquid nonselective medium 


(SD-U+A + 100 mM KCl) and purify plasmid from the liquid 
growth. 


. Transform S. cerevisiae competent cells and confirm the pheno- 


type (dark/light differential growth) by drop test. 


. Cover plates with aluminum foil to keep LB agar supplemented 


with ampicillin in the dark. Ampicillin is light sensitive and 
degrades under high temperature and light exposure. 
Moreover, storing plates in an inverted position (with their lid 
down) allows the condensed water to accumulate on the plas- 
tic lid instead of on the medium. 


. Aliquot 1 mL of antibiotics stock solution into 1.5 mL 


Eppendorf tubes. When preparing aliquots of Ampicillin, cover 
tube with aluminum foil to keep solution in the dark. Stock 
solutions are 1000x, when supplemented to LB media add 
1 pL antibiotic to each mL of medium. 


. Dissolve 14.9 g KCI in 100 mL ultrapure water to have a2 M 


stock solution. Add 250 pL, 500 pL, or 2 pL of the obtained 
stock solution to supplement 1 L medium with 0.5 mM, 
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1 mM, or 4 mM KCL, respectively. Weigh 7.4 g of KCl powder 
to prepare 1 L of 100 mM KCl containing medium. 


. Low Potassium Agar from Sigma-Aldrich (#P9338) allows the 


preparation of media supplemented with very low potassium 
concentrations (e.g., 0.5 mM). 


. Filtering vitamins and trace elements through 0.2 pm syringe 


filters ensure sterilization of the solution. Filter under laminar 
airflow hood directly into sterile 50 mL falcon tubes. Cover 
the tubes with aluminum foil to protect from light irradiation. 
Open tubes only in sterile environment. 


. Nine assemblies of Royal-blue Tri-Star Rebel LED, at 


2730 mW each, are able to irradiate | sq. m table at an average 
light distribution (A = 447 nm) of 160-230 pW/cm?. 


. Inoculate 3 mL of cell culture in a 15 mL tube. 


. This process typically requires about 4 h when applying 1:50 


dilution and about 3 h when working with 1:25 dilution. 


. Allow cells to cool down slowly by placing tubes into a polysty- 


rene box, within the —80 °C refrigerator. This procedure 
allows gradual freezing of the cells and improves their transfor- 
mation efficiency. 


Frozen-EZ Yeast Transformation II kit workflow adaptation: 
prepare in a2 mL tube a mixture of 1 pL (>200 ng/ul) expres- 
sion vector, 5 pL SGY1528 competent cells, 50 pL EZ3 
solution. Incubate cells at 30 °C for 45 min and mix the solu- 
tion every 15 min. 


The product of three consecutive EP-PCR reactions is typically 
enough to perform a gap repair transformation plated on up to 
18 petri dishes (9150 mm). 


Cut the plasmid by restriction digestion leaving sticky and blunt 
end opposite each other to avoid plasmid re-circularization. 
Lacking restriction sites can be inserted by SDM-PCR. 


This number of plates occupies roughly 1m? of an irradiated 
area during functional complementation tests. Depending on 
the cell density per plate, it can provide about 20,000 growing 
colonies per functional screening. 


SGY1528 is a K* uptake defective yeast strain: Mata Ade2-1 
canl-100 his3-11,15 leu2-3,112 trpl-1 ura3-1 trkl::HIS3 
trk2::TRP1 [8]. 

When using Zymoprep Yeast plasmid Miniprep II kit a typical 
purification from 2 mL cell culture has an average yield between 
20 and 120 ng/pL. However, the purity of this extraction is not 
suitable for sequencing, for this reason a further cloning step in 
E. coli cells and subsequent plasmid purification are necessary. 
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Seal plates with parafilm to protect cell culture from contami- 
nation and drying. Orientation mark will help to identify the 
original colony on the mother plate. 


Effective irradiation to allow yeast functional complementa- 
tion of BLINK light-driven potassium channel [5]: 
210 + 20 pW/cm? using Royal Blue Light (A = 447 nm). 


The average yield of differentially growing colonies obtained is 
up to 20 per plate. 
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Chapter 18 


Primer-Aided Truncation for the Creation of Hybrid 
Proteins 


Robert Stabel, Birthe Stiiven, Robert Ohlendorf, and Andreas Moglich 


Abstract 


Proteins frequently display modular architecture with several domains and segments connected by linkers. 
Proper protein functionality hinges on finely orchestrated interactions among these constituent elements. 
The underlying modularity lends itself to the engineering of hybrid proteins via modular rewiring; novel 
properties can thus be obtained, provided the linkers connecting the individual elements are conducive to 
productive interactions. As a corollary, the process of protein engineering often encompasses the genera- 
tion and screening of multiple linker variants. To aid these steps, we devised the PATCHY method (primer- 
aided truncation for the creation of hybrid proteins) to readily generate hybrid gene libraries of predefined 
composition. We applied PATCHY to the mechanistic characterization of hybrid receptors that possess 
blue-light-regulated histidine kinase activity. Comprehensive sampling of linker composition revealed that 
catalytic activity and response to light are primarily functions of linker length. Variants with linkers of 77 
residues mostly have light-repressed activity but those with 77 + 1 residues mostly have inverted, light- 
induced activity. We further probed linker length in the context of single residue exchanges that also lead 
to an inversion of the signal response. As in the original context, activity is only observed for certain peri- 
odic linker lengths. Taken together, these results provide mechanistic insight into signaling strategies 
employed by sensory photoreceptors and sensor histidine kinases. PATCHY represents an adequate and 
facile method to efficiently generate and probe hybrid gene libraries and to thereby identify key determi- 
nants for proper function. 


Key words DNA library, Hybrid gene, Light-oxygen-voltage, Protein engineering, Sensor histidine 
kinase, Sensory photoreceptor, Signal transduction 


1. Introduction 


Proteins of diverse biological roles consist of multiple modules, 
often corresponding to distinct protein domains, that are con- 
nected through linker segments [1]. Proper function generally 
depends on productive and precisely calibrated interactions 
among these modules, as facilitated by the intervening linkers [2]. 
Depending upon protein context, linkers differ in a number of 
parameters including length, sequence, surface charge, structure 
and flexibility. Given their eminent role in mediating interactions 
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between protein modules, linkers are also decisive in the engineering 
of proteins with novel function. In particular, desired traits can 
often be obtained by covalently connecting different modules via 
suitable linkers and by thus generating hybrid (or, chimeric) pro- 
teins. Some cases call for flexible linkers that bring modules into 
spatial proximity, but allow reorientation and relative movements; 
in other cases, rigid connectors are required to fix modules at dis- 
crete distances and defined angular orientations. Additional con- 
siderations in linker design include proper folding, intracellular 
trafficking, chemical and biological stability of resultant hybrid 
proteins [3]. Structural information, where available, and multiple 
sequence alignments often provide valuable clues as to which linker 
suffices for a specific engineering purpose. Nonetheless, a priori it 
is difficult to select among many possible linker variants the one(s) 
best suited for meeting the above criteria. As a corollary, often 
multiple linkers are constructed and empirically tested for best 
performance, which incurs considerable expenditure of time and 
effort. 

These concepts are exemplified in signal receptors and their 
engineering. In particular, sensory photoreceptors constitute the 
group of signal receptors that impart sensation of light [4]. 
Photosensor modules absorb light of appropriate quality and in 
response modulate the biological activity of effector (or, output) 
modules. Across different photoreceptor classes, the linkers con- 
necting these modules are often of a-helical or coiled-coil confor- 
mation; modifications to the linker as confined as exchange, 
addition or deletion of single residues can profoundly affect recep- 
tor activity and regulation [5-7 ]. As a case in point, we constructed 
the photoreceptor YF1 by recombining the blue-light-responsive 
light-oxygen-voltage (LOV) photosensor module of Bacillus sub- 
tilis YtvA (BsYtvA) with the Bradyrhizobium japonicum FixL 
( ByFixL) histidine kinase effector [8] (Fig. 1). The crystal structure 
of homodimeric YF1 in its dark-adapted state showed its two LOV 
photosensor modules to be connected to the effector module via a 
parallel coiled-coil linker, denoted Ja [9]. Length variations of Ja 
revealed heptad (i.e., seven-residue) periodicities of catalytic activ- 
ity and regulation by light [8]. The original YF] construct derived 
almost its entire linker from the parental protein ByFixL, but 
equally one could have used the corresponding linker of the other 
parental protein BsYtvA, or hybrids of both linkers. Given that the 
parental linkers are 23 and 27 residues long, there are (23 + 1)-(27 
+ 1) = 672 possible combinations for connecting BsYtvA and 
BjFixL if one restricts hybrid fusions to these linker segments 
(Fig. 1). Although comprehensive interrogation of all linker com- 
binations could provide invaluable insight into signaling mecha- 
nisms and engineering principles, manual construction and separate 
testing of each individual variant is prohibitively cumbersome. We 
hence sought to assess all possible linker variants in parallel. To this 
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Fig. 1 (a) Domain architecture of BsYtvA, BjFixL, and YF1. The hybrid YF1 consists of the BsYtvA LOV domain 
fused to the B/FixL histidine kinase, where almost the entire linker derives from B/FixL. (b) The crystal structure 
(PDB entry 4GCZ [9]) of homodimeric YF1 in its dark-adapted state shows the coiled-coil linker between the 
LOV photosensor and histidine kinase effector modules. (c) Schematic of the possible linker combinations for 
hybrids between the BsYtvA photosensor and the BjFixL effector modules 


end, we established the PATCHY (primer-aided truncation for the 
creation of hybrid proteins) method for the efficient generation of 
hybrid gene libraries with defined composition [10]. 

Prior to this, alternate strategies for creating hybrid gene librar- 
ies had been proposed and successfully applied for the construction 
of novel signal receptors, in particular the methods SHIPREC 
(sequence homology-independent protein recombination) and 
ITCHY (incremental truncation for the creation of hybrid pro- 
teins) [11-14]. However, for fragmentation of the parental genes, 
these methods rely on endonucleolytic or exonucleolytic DNA 
cleavage, both of which are challenging to titrate. Moreover, it is 
inherently difficult to confine hybrid fusions to specific regions of 
the parental genes such as their linker regions. Taken together, 
these limitations have hampered the wider application of either 
method [15]. In contrast, PATCHY circumvents problems arising 
from nucleolytic DNA fragmentation by means of PCR amplifica- 
tion with sets of staggered primers to truncate the parental genes 
and thus produce linker libraries of defined composition (Fig. 2). 
As detailed in this chapter (see Subheading 3.1), PATCHY starts 
from a template construct in which two parental gene fragments 
are concatenated such that they are separated by a nucleotide 
stretch that introduces a frameshift and a unique restriction site. 
The template is then amplified in a one-pot PCR reaction with sets 
of forward and reverse primers. Both primer sets are staggered in 
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Fig. 2 The PATCHY template construct contains a tandem fusion of the longest 
desired fragments of genes A and B on a circular plasmid (step 7). PATCHY uses 
staggered primers to create linear DNA molecules bearing versions of genes A 
and B that are terminally truncated at certain positions corresponding to the 
primer annealing sites (step 2). The PATCHY hybrid gene library is created through 
phosphorylation and blunt-end ligation to yield circular plasmids (step 3) 


increments of three nucleotides such as to truncate the template 
construct in the register of nucleotide triplets which corresponds 
to single amino acids at the protein level. The resultant linearized 
plasmid fragments are phosphorylated and religated to produce a 
library of circular plasmids. Optionally, remnants of the template 
construct can be depleted from the library by restriction digest. 
Plasmid libraries are then transformed and screened for favorable 
phenotypes (see Subheading 3.2). 

We applied PATCHY to the fusion of the BsYtvA LOV photo- 
sensor and the BjFixL effector module that previously yielded the 
blue-light-repressed histidine kinase YF1 [8], see above. To facili- 
tate library screening, hybrid gene libraries were constructed in the 
context of the pDusk- DsRed reporter plasmid [16] which encodes 
YF1, the cognate response regulator ByFixJ and the DsRed fluores- 
cent reporter under control of the ByFixkK2 promotor. In the dark, 
YF1 phosphorylates ByFixJ which in turn binds to the ByjFixk2 
promotor and thereby ramps up the expression of the reporter 
gene. Under blue light, YF] acts as a net phosphatase on ByjFixJ 
which in turn results in about 10-15 fold decreased DsRed expres- 
sion compared to in the dark (Fig. 3). Fluorescence-based screen- 
ing (see Subheading 3.3.1) of the PATCHY library revealed that 
photoreceptor activity and regulation by light are by and large 
determined by linker length with a pronounced heptad (seven- 
residue) periodicity. Insertion of single residues sufficed to invert 
the response to blue light. An inversion of the light response of 
YF1 can not only be effected via linker-length modifications but 


2 Materials 


2.1 Chemicals 
and Consumables 
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Fig. 3 The pDusk-DsRed reporter plasmid. In the absence of blue light, YF1 phos- 
phorylates the response regulator BjFixJ which binds to the ByFixK2 promotor 
and thereby upregulates reporter gene expression. Blue light inhibits expression 
of DsRed by around 10-15-fold as it converts YF1 to a net phosphatase [16] 


also via the introduction of the single amino-acid exchanges D21V 
or H22P within the LOV photosensor module [9, 17]. To obtain 
a better mechanistic understanding of signal transduction and 
modulation of the signal response, we applied PATCHY to the 
systematic interrogation of linker determinants in the background 
of either residue exchange (see Subheading 3.3.2). Similar to the 
results for YF1, activity was only observed in discrete heptad regis- 
ters, 1.e., 7m for H22P, as well as 77 and 7” + 1 for D21V. In one 
variant, elongation of the linker by three residues counteracted the 
inversion of the signal response originally caused by the D21V 
exchange. 


. 5x HF Buffer for PCR. 

. 2 U/pL Phusion High-Fidelity DNA Polymerase. 
. 10 mM dNTPs each. 

. 10 pM total forward primer pool for PATCHY. 

. 10 pM total reverse primer pool for PATCHY. 

. 10 U/pL Nobel restriction enzyme. 

. 10 U/pL Don restriction enzyme. 

. 10 U/L T4 Polynucleotide Kinase. 

. 50% PEG-4000 (w/v). 

. 0.5 mM ATP. 

. 30 U/pL T4 DNA Ligase. 

. 50x TAE buffer: 2 M Tris-Acetate, 50 mM EDTA, pH 8.5. 
. 1% Agarose in 1x TAE Buffer. 


. LB/Kanamycin medium: 10 g peptone, 5 g yeast extract and 
10 g NaCl in 1 L dH,O supplemented with 50 mg/L 
Kanamycin. 
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2.2 Lab Equipment 


2.3 Software 


3 Methods 


15. 


16. 
17. 
18. 


19. 


20. 


21. 


1. 


10. 


LB/Kanamycin agar plates: LB/Kanamycin medium supple- 
mented with 20 g/L agar. 


Chemically or electrocompetent Escherichia coli cells. 

50% glycerol (w/v). 

NucleoSpin Extract Kit for purification of PCR products 
(Macherey Nagel). 

96-deep-well microtiter plates for growth of bacterial clones 
(e.g., Axygen). 

Clear 96-well microtiter plates for absorption measurements 
(e.g., Nunc). 

Black 96-well microtiter plates for fluorescence measurements 
(e.g., Nunc). 


Gradient thermal cycler for PCR amplification (e.g., Thermal 
Cycler $1000, Bio-Rad). 


. Electrophoresis chamber (e.g., Wide Mini-Sub Cell GT Cell, 


Bio-Rad). 


. Nanodrop  spectrophotometer (e.g., Spark 10 M_ with 


Nanoquant plate, Tecan). 


. Microplate reader with absorption and fluorescence optics 


(e.g., Infinite M200 pro, Tecan). 


. Two incubators (e.g., Incu Line IL10, VWR). 
. Two shakers for microtiter plates (e.g., PMS-1000i, Grant). 
. Blue-light LED array, custom built, 10 x 8 LEDs of 470 + 


10 nm (Winger Electronics). 


. Lamp power meter (model 842-PE, Newport) with silicon 


photo detector (model 918D-UV-OD3, Newport). 


. Laser safety goggles for fluorescence-based screening on agar 


plate (Roithner Lasertechnik, 550 nm long pass). 


Optional: Flow cytometer with sort functionality (e.g., S3e, 
Bio-Rad). 


. Python script for design of staggered primers: https://github. 


com/vrylr/PATCHY. git 


The generation and analysis of PATCHY hybrid gene libraries are 
described in Subheadings 3.1 and 3.2, respectively. To illustrate 
the general method and individual steps, by way of example we 
repeatedly refer to a recent study in which we applied PATCHY to 
recombine the BsYtvA photosensor with the ByFixL effector module 
(see Subheading 1) [10]. Key results from these experiments are 


3.1. Generation 
of PATCHY Libraries 


Generation of Hybrid Gene Libraries 293 


summarized as a case study in Subheading 3.3.1. The application 
of PATCHY to the same two modules but with either the residue 
exchange D21V or H22P within the photosensor is covered as a 
second case study in Subheading 3.3.2. 


1. Principal considerations: In a one-pot reaction, PATCHY gen- 
erates defined libraries of hybrid genes with single fusion sites 
between an upstream gene A and a downstream gene 
B. Individual library members differ in which fragments of the 
parental genes A and B they comprise. As a first step, the 
desired composition of the library is specified by deciding 
which set of gene fragments of A and B are to be recombined. 
In the case study, we constructed hybrid variants that connect 
the entire BsYtvA LOV photosensor module including a vari- 
able number 2 of residues of its C-terminal linker (2 € {0, 1, ..., 
23}) to the entire ByFixL effector module including a variable 
number 7 of residues of its N-terminal linker (7 € {0, 1,..., 27}) 
(see Figs. 1 and 2). 


2. PATCHY template construct: PATCHY libraries are obtained 
by PCR amplification of a template construct with staggered 
primer sets. The template construct contains a hybrid fusion 
between the longest desired fragments of each of genes A and B. 
A spacer sequence that harbors a unique restriction site and 
deliberately introduces a frameshift is inserted between the two 
gene fragments. In the case study, the BsYtvA LOV photosen- 
sor including its entire C-terminal linker (2 = 23) was con- 
nected to the ByFixL effector including its entire N-terminal 
linker (j = 27). The intervening spacer sequence encoded a 
frameshift and a unique Néel restriction site. To facilitate sub- 
sequent analysis and screening of PATCHY libraries (see 
Subheading 3.2), the template construct was assembled in the 
background of the pDusk-DsRed reporter plasmid [16] which 
affords facile fluorescence readout of receptor activity, see 
Subheading | and Fig. 3. 


3. Primer design: Sets of forward and reverse oligonucleotide 
primers are devised such that during the PATCHY PCR reac- 
tion they lead to incremental truncations of the template con- 
struct at the 3’ end of the upstream gene A and at the 5’ end 
of the downstream gene B (see Fig. 2). For this purpose, both 
sets are staggered in increments of base triplets, corresponding 
to single amino acids at the protein level. To facilitate the 
PATCHY PCR reaction, the primers should be designed with 
largely uniform melting temperatures (T),), ideally all within 
+1 °C. Primers can either be designed manually or in auto- 
mated manner with a Python script (see Note 1). All forward 
primers are pooled at equimolar ratios, and the total concen- 
tration is adjusted to 10 uM; likewise, a pool of all reverse 
primers at 10 1M total concentration is prepared (see Note 2). 
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Table 1 

PATCHY PCR reaction mix 
Reagent Quantity 
5x HE buffer 10 pL 
10 mM dNTP mix 1 pL (0.2 mM each) 
DNA template 1 pL (50 pg-1 pg) 
10 uM total fwd. primer pool 1.25 pL (0.25 pM) 
10 uM total rev. primer pool 1.25 pL (0.25 pM) 
2 U/uL Phusion polymerase lpL 
ddH,O Add to 50 pL 

Table 2 

PATCHY PCR program 
Step Temperature Time 
1 9p 30s 
2, WH 30s 
3 Tn-5 °C 30 s 
4 72°C 30 s/kb 
5 go to step 2 35 cycles 
6 De 10 min 
W 10°C oo) 
4, PATCHY PCR reaction: The PCR reaction mixture is pre- 


pared according to Table 1, and PCR amplification is con- 
ducted as described in Table 2 where the annealing temperature 
is set at T,,-5 °C. Analysis of the reaction products by agarose 
gel electrophoresis should show one dominant or even a single 
DNA band of the expected size. For 7 forward and m reverse 
primers, the PCR product should theoretically comprise 7 » m 
different, incrementally truncated, linear PCR fragments. 


. Workup of PATCHY PCR reaction: The product of the PCR 


reaction is purified using standard molecular biology Kits (e.g., 
NucleoSpin Gel and PCR clean up, Macherey-Nagel). To 
achieve high DNA concentration for subsequent steps, the 
PCR product should be eluted with 30 pL ddH,0O or elution 
buffer. The concentration is determined spectrophotometri- 
cally or by analysis via gel electrophoresis and comparison to a 
standard of known concentration. Ideally, one should have a 
total of at least 1-2 pg DNA at this stage (see Note 3). 
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Table 3 
Phosphorylation of linear DNA fragments 


Reagent Quantity 
10x T4 DNA ligase buffer Sis) (WIE, 
Linear PCR fragment 30 pL (50 pg-1 pg) 
10 U/pL T4 polynucleotide linase 2 pL 
Table 4 


Ligation of phosphorylated linear DNA fragments 


Reagent Quantity 
Phosphorylation reaction mix 35 pL 
50% PEG-4000 (w/v) 4 uL 
0.5 mM ATP (optional) 0.5 pL 
30 U/uL T4 DNA ligase 1 pb 


6. Optional: To deplete the template construct from the reaction 
mixture, the PCR product is alternatively purified via agarose 
gel extraction. Depending upon the plasmid size, the circular, 
negatively supercoiled template can show different electropho- 
retic mobility on agarose gels from the desired linear PCR 
products. It is hence often possible to separate the original 
template construct from the PATCHY library (see Note 4). 


7. Phosphorylation of PCR product: The PCR product is phos- 
phorylated at the 5’ end by polynucleotide kinase according to 
Table 3. The reaction mixture is incubated at 37 °C for 30 min. 


8. Circularization of linear PCR product: Ligation of the phos- 
phorylated, linear PCR products yields circular plasmids and is 
carried out in the same reaction solution as phosphorylation. 
The reaction mix is allowed to cool to 22 °C before T4 DNA 
ligase, PEG-4000, and ATP (optional) are added (Table 4). 
The reaction mix is then incubated at 22 °C for 1 h (see Note 5). 


9. Optional—Depletion of template construct from PATCHY 
library: The original template construct is selectively depleted 
from the PATCHY library via restriction digest with the 
enzyme that cuts at the deliberately introduced unique site 
within the spacer sequence of the template (see step 2). For the 
example case of the PATCHY library between the BsYtvA LOV 
sensor and the ByFixL effector, 1.5 pL Nhel was added, fol- 
lowed by incubation at 37 °C for 30 min. To prevent recircu- 
larization, 1.5 pL of the phosphatase FastAP was added and 
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3.2 Analysis 
of PATCHY Libraries 


10. 


11. 


incubation continued for 15 min at 37 °C. Enzymes were inac- 
tivated by incubation at 75 °C for 15 min (see Note 4). 


Optional — Depletion of template construct from PATCHY 
library: Alternatively, the template construct can selectively 
be digested with DpnI which cleaves DNA in methylation- 
dependent manner (see Note 4). 


Transformation of PATCHY library: The PATCHY library can 
directly be transformed into Escherichia coli cells (e.g., 
DH10b). For example, 5-10 pL of the reaction mix were 
transformed into chemically competent cells, or up to 1.5 pL 
were transformed into electrocompetent cells. After plating on 
LB agar and overnight incubation at 37 °C, the library can 
either be processed right away (see Subheading 3.2) or stored 
at —80 °C as a glycerol stock. 


. Sequence analysis of the naive PATCHY library: To assess the 


quality of the PATCHY library, a number of individual clones 
can be analyzed by Sanger sequencing. In case the original 
template construct is found with significant frequency, extra 
efforts should be taken to deplete it from the library (see 
Subheading 3.1, steps 6, 9, and 10 as well as Note 4). 


. Optional: Alternatively, the library can be analyzed by next- 


generation sequencing (NGS). In the case study [10], the 
entire PATCHY plasmid library was fragmented by ultrasound 
and sequenced on an Illumina platform with paired-end 
150-base-pair reads. Although most reads covered the invari- 
ant plasmid backbone, the low cost of NGS still allowed to get 
good coverage of the variant parts of the plasmids. We obtained 
approximately 5500 paired-end reads covering the linker 
region, corresponding to an ~8-fold oversampling of the 
expected 672 different hybrid genes (Fig. 4). Out of these 672 
variants, 578 could be detected by NGS. The data also indi- 
cated that each forward and reverse primer had been used in 
the PATCHY PCR reaction albeit to varying extents. 


. Optional: In case the sequence analysis indicates a nonrandom, 


biased distribution of expected constructs in the PATCHY 
library, one may rerun the PATCHY PCR reaction with 
adjusted relative primer concentrations (see Subheading 3.1, 
steps 3 and 4 as well as Note 2). 


. Optional: To further diversify the PATCHY library, it can be 


subjected to an error-prone PCR reaction (see Note 6). 


. Isolation of hybrid variants from the PATCHY library: The 


identification of hybrid variants with desirable traits in the 
PATCHY library is obviously governed by the assays available 
to read out activity. In principle, individual clones can be iso- 
lated and analyzed separately, but ideally functionality can 
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linker length / B/FixL 


linker length i BsYtvA 


Fig. 4 Next-generation sequencing of the PATCHY library of hybrids between 
BsYtvA photosensor and BjFixL effector. The x- and y-axes denote the number of 
residues / and / from the linkers of the parental BsYtvA and BjFxL that a given 
variant contains. Grey squares indicate theoretically expected constructs indeed 
observed in the library. Each primer was used in the PATCHY-PCR reaction 


directly be assessed with high throughput and in parallel in 
vivo, i.e., without laborious intermediate steps such as protein 
purification. Efficient in vivo assays largely fall into two clades: 
first, screening assays are based on a readily measurable, often 
colorimetric readout such as fluorescence; second, selection 
systems link functionality of the hybrid variants to cell survival 
or proliferation. In the example case of the chimerae between 
BsYtvA LOV and the ByFixL histidine kinase, we resorted to 
the pDusk-DsRed screening system which links hybrid recep- 
tor activity to expression of a DsRed fluorescent reporter gene 
[10, 16]. Transformed cells are cultured at 37 °C on plate in 
darkness or under constant blue light illumination (470 nm, 
40 pW/cm7?). Clones harboring functional hybrid variants that 
expressed DsRed under the given incubation condition are 
visually identified via observation of fluorescence through a 
550-nm long-pass filter goggle. As detailed in Subheading 3.1 
step 1, the template construct is incapacitated by a deliberate 
frameshift between the two gene fragments recombined via 
PATCHY. Screening for function is thus well suited to further 
discriminate against the template construct (see Note 4). 


6. Optional: As an alternative screening method, fluorescence- 
activated cell sorting can be used, provided function of hybrids 
is linked to a fluorescent readout [16] (see Note 7). 


7. Storage of isolated hybrid variants: Clones with desirable phe- 
notypes identified in the previous two steps are grown at 37 °C 
to stationary phase. To this end, single wells of a 96-deep-well 
microtiter plate are filled with 600 pL LB medium supple- 
mented with the required antibiotic and are inoculated with a 
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3.3 PATCHY Case 
Studies 


3.3.1 Linker 
Libraries of YF1 


single clone. For long-term storage, 200 pL of the saturated 
cultures is mixed with 200 pL 50% (w/v) glycerol and frozen 
at —80 °C. 

8. Quantitative characterization of isolated hybrid variants: Using 
suitable functional assays, the biological activity of the hybrid 
variants isolated in Subheading 3.2 step 5 should be character- 
ized in more detail. In the example case, replicates of each 
clone were inoculated in 5 mL LB/Kanamycin and grown at 
37 °C for 16 hin darkness or constant blue light (470 nm, 100 
uW /cm7) (see Note 8). Saturated cultures were diluted tenfold 
in ddH,O, and ODgo9 was measured in clear microtiter plates 
with a microplate reader. Samples were further diluted fivefold 
in ddH,0, and DsRed fluorescence was measured in black 
microtiter plates with settings of 554 + 9 nm for excitation and 
591 + 20 nm for emission, respectively. Fluorescence readings 
were normalized to the ODsoo of the corresponding sample. 


9. Sequence analysis of isolated hybrid variants: Clones that 
showed desirable properties in the previous step are analyzed 
by Sanger DNA sequencing (see Note 9). 


We applied PATCHY to better characterize the underlying design 
principles and signal-transduction mechanisms in the chimeric 
photoreceptor YF1 [8]. The original YF1 receptor originated from 
the fusion of the blue light-sensitive light-oxygen—voltage photo- 
sensor module of BsYtvA to the effector module of the ByFixL 
histidine kinase. Notably, in YF1 the linker between these modules 
essentially derived from the ByFixL parental protein, 1.e., 7 = 3 and 
j = 25 (see Subheading 3.1 step 1). Sparse sampling of linker 
composition had earlier identified linker length as the main deter- 
minant for activity and regulation of the hybrid receptors [8]. A 
seven-residue periodicity of the dependence of activity and regula- 
tory properties on linker length resulted from the coiled-coil con- 
formation of the linker, as later evidenced in the high-resolution 
structure of dark-adapted YF1 [9]. Notably, only a tiny fraction of 
many conceivable hybrid variants were studied at this point [8]. 
Despite these biochemical and structural data, the mechanism 
by which signals are transduced from the LOV photosenor to the 
histidine kinase effector remained unclear. We reasoned that com- 
prehensive interrogation of linker sequence space could yield addi- 
tional mechanistic insight. The linkers between the respective 
sensor and effector modules in the parental receptors BsYtvA and 
BjFixL comprise 23 and 27 residues, respectively. Provided that 
fusions between the parental proteins are restricted to these linker 
regions, there are (23 + 1) ¢ (27 + 1) = 672 different ways to 
recombine the BsYtvA LOV photosensor with the ByFixL effector 
(see Figs. 1 and 2). Using PATCHY as described in Subheading 
3.1, we generated a construct library that theoretically contains 
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a Helical register m b Helical register m 
2 6 3 0 4 1 5 2 6 3 0 4 1 5 


2 6 3 0 4 1 5 


Fig. 5 Linker-length analysis of light-regulated BsYtvA-BFixL hybrid variants. (a) For YF1, light-repressed vari- 
ants (filled circles) showed periodic linker lengths of 7n or 7n + 5 residues but light-activated variants (open 
circles) mostly occurred at linker lengths of 7n + 1 residues. (b) Light-activated D21V variants had linkers of 
7n or 7n + 1 residues. Two variants with linkers belonging to the 7n and 7n + 3 classes showed inverted, 
light-repressed response. (c) For H22P, only light-activated variants were identified that predominantly pos- 
sessed linkers of 7n residues 


these 672 linker variants. Library construction was performed in 
the context of the pDusk-DsRed reporter plasmid [16] that affords 
rapid screening for functional hybrid variants as described in 
Subheading 3.2 (see Fig. 3). 

Key results from the analysis of the PATCHY library are sum- 
marized in the following and in Fig. 5a. Strikingly, light-regulated 
activity in BsYtvA-BjFixL hybrids is supported by quite different 
linker lengths ranging from 4 to 50 residues, whereas the original 
YF1 possesses a linker of 28 residues. Assuming a continuous 
a-helical coiled-coil structure of the linker elements, this translates 
into distances of separation of ~6-75 A between LOV photosen- 
sor and effector modules. In addition to hybrids with light- 
repressed activity as in YF1, several variants showed an inverse 
response to light, i.e., they had light-induced activity. Hybrid vari- 
ants with alike responses to light usually displayed a pronounced 
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3.3.2 Linker Libraries 
of Signal-Inverted YF1 


heptad periodicity in their linker lengths. Whereas light-repressed 
hybrid variants predominantly possessed linkers of 77 and 7n + 5 
residues, those with light-induced activity mostly had linkers of 7 
+ 1 residues (” € {1, ..., 7}). The heptad dependence on linker 
length can be accounted for by the coiled-coil nature of the linker. 
Sequence analysis of a large group of related, natural sensor histi- 
dine kinases revealed their sensor and effector modules to be 
mainly connected by coiled-coil linkers of length 7” and 7” + 5 
residues [10]. 

The difference in the preferred linker lengths for light-repressed 
and light-activated hybrid variants provides clues as to the mecha- 
nistic bases for signal transduction in YF1. Notably, the registers 
7n and 7n + 1 differ by insertion of a single residue; within a 
canonical coiled coil such an insertion induces an angular shift of 
around 100° [18]. We thus propose that signal transduction in 
YF] is predicated on angular reorientation of the LOV photosen- 
sor relative to the histidine kinase effector modules [8, 9, 19, 20]. 
This type of angular reorientation could also be brought about by 
a change in the degree of supercoiling of the linker. We recently 
investigated light-dependent signal transduction in YF] by elec- 
tron paramagnetic resonance (EPR) spectroscopy and X-ray solu- 
tion scattering [19, 20]. Indeed, the two independent and 
complementary approaches identify light-induced left-handed 
supercoiling as the mode for signal transduction and thereby pro- 
vide a structural rationale for the PATCHY data. 


The inversion of the response to light in YF1 cannot only be 
achieved by linker variations but also by the replacement of certain 
residues within the LOV photosensor units [9, 17]. To investigate 
the basis for this profound effect on signal transduction, we 
repeated the PATCHY analysis of the BsYtvA-BjFixL chimerae in 
the background of either the D21V or H22P exchange, both of 
which induce inversion of the light response in YF1. 

In case of D21V, hybrid variants with the same qualitative sig- 
nal response as the original YF] D21V, i.e., with an increase in 
activity upon blue-light exposure, fell into two distinct classes with 
linkers of 7m and 7n + 1 residues, respectively (Fig. 5b). The pref- 
erence for these classes is indicative of the coiled-coil structure of 
the linker in the D21V variants. Interestingly, in the YF] context 
the clades 7m and 7n + 1 were associated with light-repressed and 
light-induced activity, respectively, whereas in case of D21V they 
both predominantly gave rise to light activation. However, two 
D21V variants with linkers of 31 (= 7 ¢ 4 + 3) and 42 (= 7 « 6) resi- 
dues displayed light-repressed activity; in particular, in the former 
construct, DsRed reporter fluorescence was enhanced by around 
9-fold in darkness versus blue light. Intriguingly, the signal inver- 
sion introduced by the D21V exchange could thus be reverted in a 
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construct with a linker that is elongated by three residues. Currently, 
only two D21V variants with distinctly light-repressed activity 
could be identified, and a more detailed analysis of the underlying 
linker lengths is therefore precluded. Exhaustive sampling of linker 
variants in the D21V context by PATCHY might yield additional 
light-repressed variants and allow this type of analysis. 

For the H22P background, only hybrid variants with light- 
induced activity as the original YF] H22P were found but none 
with light-repressed activity (Fig. 5c). Consistent with a coiled- 
coil conformation, the linkers of these variants predominantly 
comprised 77 residues. A recent EPR investigation [19] revealed 
that introduction of the H22P exchange leads to a complete rear- 
rangement of the dimer interface; two N-terminal « helices that 
in YF] are tucked in between the LOV photosensor domains (see 
Fig. 1) are displaced and probably unfolded in H22P. Given this 
drastic change of the dimer interface, it is perplexing that the 
H22P variant transduces signals, let alone in inverted manner. 
Despite these differences, signal transduction in H22P apparently 
employs a highly similar structural mode as YF1, albeit with 
inverted sign [19]. In support of this view, we now find that the 
linkers of variants with signal response corresponding to that of 
YF1 H22P mostly conformed to 77 residues. By that token, one 
would also expect to find H22P linker variants with inverted, i.e., 
light-repressed activity. Our inability to do so may stem from 
insufficient sampling of linker sequence space or from an inherent 
difference in the signal mechanism of H22P. The above finding 
that the light response of the D21V variant could be inverted for 
certain linker lengths implies that the former is true and that sig- 
nal inversion by linker-length variations also applies to the H22P 
context. 


The PATCHY method offers an efficient route toward libraries of 
hybrid genes with a single fusion between defined fragments of 
two parental genes A and B. In contrast to related approaches for 
the construction of hybrid gene libraries [11-15], PATCHY uses a 
simpler protocol and obviates incremental nucleolytic digest of the 
parental genes which is difficult to precisely adjust. Rather, PCR 
amplification with sets of staggered primers exactly delineates 
which fragments of A and B are generated and recombined. The 
analysis of PATCHY hybrid libraries benefits from efficient activity 
assays, in particular cell-based screening and selection approaches. 

We demonstrate the utility of PATCHY for the test case of 
hybrid blue light photoreceptors. Catalytic activity and response 
to light are largely governed by the length of the linker interven- 
ing the constituent photosensor and effector modules of the 
receptors. A striking seven-residue dependence of receptor func- 
tion on linker length is explained by the continuous coiled-coil 
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4 Notes 


structure of said linker. Insertion of single residues in the linker 
can lead to inversion of the receptor response to blue light [10]. 
Recent mechanistic investigations [19, 20] reveal light-induced 
left-handed supercoiling of the linker and thus provide the struc- 
tural rationale for signal inversion via linker-length variations. In 
the context of the residue exchanges D21V and H22P, each of 
which induces signal inversion in YF], light-regulated function is 
only sustained by linkers of discrete lengths. 

As a method, PATCHY is generally suitable for the construc- 
tion and mechanistic interrogation of hybrid gene libraries. In 
addition to the engineering of signal receptors, PATCHY also 
appears well suited to the construction of fluorescent reporters. 


1. The computer script is available from Github at https: //github. 
com/vrylr/PATCHY. git. To execute it, a Python interpreter is 
required which can be obtained free of charge from http:// 
www.python.org or https://winpython.github.io. 


2. We recommend to initially pool all forward primers at equi- 
molar ratio and use them at a total concentration of 10 1M. 
The reverse primers should be prepared accordingly. Given 
that the individual primers differ in their sequences, they may 
well possess different annealing and amplification efficiencies 
which would result in a biased distribution of hybrid genes. 
Adjustment of the relative primer concentrations may be a 
remedy for this problem. In principle, one can employ non- 
equal concentrations within each primer pool to deliberately 
predispose the PATCHY PCR reaction toward the generation 
of certain sets of hybrid variants. Alternatively, the PATCHY 
protocol can be rerun with subsets of the forward and reverse 
pools that entirely lack certain primers. 


3. One can carry on with the subsequent steps, even if less DNA 
has been obtained at this stage. 


4, We found it useful to implement several steps for removal of 
the initial template construct from the PATCHY library. (a) In 
many cases, the linear PATCHY PCR products can effectively 
be separated from the circular template via gel extraction (see 
Subheading 3.1 step 6). (b) Following phosphorylation and 
circularization of the PATCHY PCR products, the template 
can selectively be degraded via restriction digest with DpzI or 
the specific endonuclease chosen in the template design (see 
Subheading 3.1 steps 9 and 10). (c) Lastly, a deliberate frame- 
shift can be introduced into the template construct to disable 
it. Functional analysis of the PATCHY library, in particular by 
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efficient screening or selection procedures, provides a ready 
means for discrimination against the template construct (see 
Subheading 3.2 step 5). In case the resultant PATCHY library 
still contains an exceedingly high fraction of the initial tem- 
plate construct, steps (a) and (b) can be repeated. 


5. The ligation reaction can alternatively be carried out at 16 °C 
over night. 


6. A particular strength of PATCHY is the generation of hybrid 
genes of defined composition. Although this usually is the 
wanted outcome, in certain scenarios the desired traits may not 
manifest in any of the hybrid gene variants within the PATCHY 
libraries. In these cases, the PATCHY libraries can further be 
diversified via error-prone PCR [21]. Introduction of random 
residue exchanges, in particular within the linker region 
between protein modules, may well yield variants with the 
desired functional properties. 


7. The screening of PATCHY libraries can be expedited with 
fluorescence-activated cell sorting. This is especially useful in 
case of protein switches that are supposed to show differential 
activity in response to a signal. PATCHY hybrid libraries can 
alternately be screened in the absence and presence of this sig- 
nal, and stringently responding variants can thus be enriched, 
identified and isolated. 


8. Alternatively, the activity of the PATCHY variants can be 
assessed in microtiter format. Within a 96-deep-well plate, 
aliquots of 600 pL LB/Kanamycin medium are inoculated 
with individual variants and incubated for 16 h in darkness or 
under constant blue light (470 nm, 40 pW/cm?). All subse- 
quent steps are conducted as described in Subheading 3.2 
steps 8 and 9. 


9. Many companies offer plasmid preparation and Sanger 
sequencing in microtiter format. For larger number of hybrid 
variants, this approach usually is more economic in terms of 
time and money. 


Discussions with C. Engelhard and R. Bittl (Freie Universitat 
Berlin) are appreciated. Financial support through Deutsche 
Forschungsgemeinschaft grant MO2192/4-1 (A.M.), through a 
Sofja-Kovalevskaya Award by the Alexander-von-Humboldt 
Foundation (A.M.), and through Boehringer-Ingelheim Fonds 
(R.O.) is gratefully acknowledged. 
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Engineering Small Molecule Responsive Split Protein 
Kinases 
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Abstract 


The over 500 human protein kinases are estimated to phosphorylate at least one-third of the proteome. 
This posttranslational modification is of paramount importance to intracellular signaling and its deregulation 
is linked to numerous diseases. Deciphering the specific cellular role of a protein kinase of interest remains 
challenging given their structural similarity and potentially overlapping activity. In order to exert control 
over the activity of user-defined kinases and allow for understanding and engineering of complex signal 
transduction pathways, we have designed ligand inducible split protein kinases. In this approach, protein 
kinases are dissected into two fragments that cannot spontaneously assemble and are thus inactive. The two 
kinase fragments are attached to chemical inducers of dimerization (CIDs) that allow for ligand induced 
heterodimerization and concomitant activation of kinase activity. 


Key words Protein kinase, Split kinase, Split protein, Ligand gating, Chemical inducer of dimerization 


1. Introduction 


The activity and function of many, if not all, proteins are regu- 
lated by a multitude of chemical perturbations, collectively 
termed posttranslational modifications (PTMs) [1]. Of these, 
protein phosphorylation may be the most common [2], where 
some estimate that three-quarters of the proteome can be phos- 
phorylated [3, 4]. Protein phosphorylation is catalyzed by a class 
of enzymes known as protein kinases. Protein kinases are involved 
in almost all pathways, from cell division to cell death [5], and 
their aberrant function is implicated in a plethora of diseases 
such as cancer [6], metabolic disorders [7], inflammation [8], 
and neurological disorders [9]. This renders kinases very attrac- 
tive targets for therapeutic intervention [10-12], and protein 
kinase inhibitors are currently the second largest group of thera- 
peutics. Kinase research has advanced tremendously since the 
discovery of the first serine [13] and tyrosine kinases [14], 
respectively. However, deciphering the specific cellular role of a 
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specific kinase remains challenging. Though powerful, genetic 
methods such as siRNA-based genetic knockdown and CRISPR- 
Cas gene editing, which provide insight regarding the function 
of a specific protein kinase, presently lack spatial and temporal 
control and often fall prey to compensatory cellular mechanisms. 
Toward the goal of establishing temporal control of a specific 
protein kinase, three elegant posttranslational control methods 
have been developed that build on structural studies [15-17] 
and seek to either turn a specific kinase on or off. 

Shokat and coworkers pioneered a pharmacological knockdown 
approach by designing inhibitor-sensitive kinase alleles, where 
mutation of the gatekeeper residue adjacent to the kinase ATP- 
binding cleft allowed for inhibition with uniquely complementary 
inhibitors [18-20]. A more recent protein engineering approach 
described by Hahn and coworkers provides allosteric control over 
kinase activity by using a modified FK506 binding protein (FKBP) 
as an insertion motif in a highly conserved region of the catalytic 
domain of a protein kinase, which renders the kinase inactive. The 
subsequent addition of the small molecule rapamycin as well as the 
FKBP12 — rapamycin binding protein (FRB), likely restores cata- 
lytic activity through the reestablishment of the native tertiary 
structure of the disrupted catalytic domain [21, 22]. A third 
approach, pioneered by Chin and coworkers, incorporated a genet- 
ically encoded nonnatural photocaged lysine implicated in catalysis 
that renders the kinase inactive [23]. Irradiation of cells expressing 
the mutant kinase uncages lysine, leading to a gain of kinase activ- 
ity and downstream signaling. Each of these innovative approaches, 
focusing on rendering a specific kinase inactive or active, has merits 
for interrogating kinase-dependent biological pathways. However, 
these approaches do not currently allow for the simultaneous con- 
trol and study of multiple kinases. 

The approach discussed in this chapter, split protein kinases 
(split kinases), complements the three previously described meth- 
ods, providing ligand-induced gating of the activity of not only 
one, but multiple kinases, which can allow for understanding and 
engineering of more complex signal transduction pathways [24-26]. 
A split kinase consists of a protein kinase dissected into two frag- 
ments, each of which is attached to an interacting protein pair from 
a system commonly called chemical inducers of dimerization (CID) 
(Fig. 1). A CID system is composed of two proteins that do not 
interact with each other until an input, typically a small molecule, 
is added, which causes the proteins to interact or dimerize [27]. 
Thus in our design, each fragment of a split kinase attached to 
ligand responsive proteins is inactive until reassembled through 
the addition of the appropriate chemical input with attendant 
restoration of catalytic activity. Conceptually, the control of mul- 
tiple kinases is possible by using orthogonal sets of CIDs. We have 
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Fig. 1 (a) Split protein kinase consisting of two fragments of a split kinase (NTerm and CTerm), independently 
attached to the interacting proteins of a chemical inducer of dimerization (CID) system (CID1 and CID2), that reas- 
semble upon the addition of a chemical input. Upon ligand —gated reassembly, the split kinase is activated and 
phosphorylates protein substrate(s). (b) The three orthogonal small molecule dependent CID systems that have 
been used to construct split kinase sensors: rapamycin (Rap), abscisic acid (ABA), and gibberellic acid (GA,) 


successfully designed and constructed split kinases using three 
orthogonal CIDs: (1) rapamycin (Rap), which heterodimerizes 
FKBP and FRB; (2) abscisic acid (ABA), which heterodimerizes 
modified versions of interacting proteins, pyrabactin resistance 
1-like protein (PYLcs, amino acids 33 to 209 of PYL1), and type 
2C protein phosphatase, abscisic acid-insensitive 1 protein (ABIcs*, 
amino acids 126 to 423, inactive D143A mutant of ABI) [28, 29]; 
and (3) gibberellic acid (GA3), which heterodimerizes interacting 
proteins gibberellin insensitive dwarfl receptor (GID1) and a 
truncated version of the protein gibberellin insensitive (GAI(92), 
amino acids | to 92) [30, 31]. Rapamycin is a widely utilized CID, 
but interferes with the mTOR pathway, and designed analogs 
(known as rapalogs) can be used in its place in studies where mTOR 
activity is relevant. ABA and GA; are both plant hormone-based 
CIDs and can be potentially used in mammalian cells without 
perturbing native pathways. 

In order to develop split kinases, the two main challenges are 
the identification of appropriate dissection sites and the develop- 
ment of robust methods for identifying functionally reassembled 
proteins. We identified potentially general sites for constructing 
ligand-gated split kinases by identifying sequences in the catalytic 
domain of protein kinases that harbor significant dissimilarities 
[25]. These sites correspond to positions 268, 311, and 393 of 
tyrosine kinase Lyn. With regard to a general and practical method 
to interrogate for split kinase activity, we developed a rapid in vitro 
or in cellulo system for production of the split kinase, with subse- 
quent purification and testing of kinase activity through a direct 
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radioactivity-based assay, which remains a gold standard for activity 
measurements. The in vitro procedure can be completed in 9-10 h, 
which allows for rapid and practical iterative optimization of a par- 
ticular split kinase design. We have successfully used this approach 
to construct split kinases for the catalytic domains of tyrosine 
kinases Src, Lyn, Fyn, Abl, Fak, Hck, the catalytic domain of ser- 
ine/threonine kinase PKA, and full-length tyrosine kinases Src, 
Lyn, and Abl. We envision that other desired split enzymes can be 
similarly designed, interrogated, and optimized. 

The following protocol describes strategies used to construct a 
split protein sensor for specific kinases, and to test the design by 
expressing the fragments either using in vitro or im cellulo 
approaches, and subsequently measuring kinase activity in its on 
and off states (Fig. 2). 
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Fig. 2 General scheme showing the (a) in vitro and (b) in cellulo expression of split protein kinase sensors, 
followed by (c) His6-tag based protein purification and (d) radioactivity-based kinase activity assay. This meth- 
odology allows for the rapid interrogation of activity and optimization of split kinases 


2 Materials 


2.1 Plasmid 
Construction 
and Cloning 


2.2 Split Kinase 
Protein Expression 


2.2.1 In Vitro Expression 
in Rabbit Reticulocyte 
Lysate 


OonN HA UW 


Split-Protein Kinases 311 


. Plasmids encoding for the kinase of interest and the interacting 


proteins from the desired CID system (Addgene is a good 
source for plasmids). We have almost exclusively focused on 
protein tyrosine kinases and the protocols reflect our bias. 


. Plasmid vector for cloning, such as pRSFDuet-1 and 


pcDNA3.1(+). 


. Appropriate oligonucleotide primers to generate inserts. 


. Enzymes for cloning procedures with the necessary reaction 


buffers and dNTPs. Thermostable DNA polymerase (such as 
Taq, e.g., from New England BioLabs or KAPA from KAPA 
Biosystems), Klenow fragment (e.g., New England BioLabs), 
appropriate restriction enzymes, calf intestinal alkaline phos- 
phatase (CIP, e.g., from New England BioLabs), DNA ligase 
(such as T4 DNA ligase, e.g., from New England BioLabs). 


. Competent cells, such as E. cols XL1-Blue, and appropriate 


growth media, such as Luria—Bertani agar and broth. 


. Plasmid DNA and PCR product purification kits. We typically 


use Macherey-Nagel NucleoSpin Plasmid, and NucleoSpin Gel 
and PCR Clean-up kits. 


. DNA polymerase such as Taq or KAPA with provided reaction 


buffer and dNTPs. 


. RNA production system (such as Promega T7 RiboMAX 


Large Scale RNA production system), with provided reaction 
buffer and rNTPs. 


. PCR-product purification kit (e.g., from Macherey-Nagel) and 


G50-microcolumns for mRNA purification (e.g., from GE 
Healthcare). 


. Rabbit reticulocyte lysate (RRL) system (ours was donated 


by Luceome Biotechnologies), which contains the cellular 
components necessary for protein synthesis (tRNAs, amino 
acids, ribosomes, initiation, elongation, and termination factors). 
Commercial systems are also optimized to include an energy- 
generating system consisting of prequalified phosphocreatine and 
phosphocreatine kinase, a mixture of tRNAs to expand the range 
of mRNAs that can be translated, hemin to prevent inhibition of 
initiation, and potassium acetate and magnesium acetate. 


. CID: 6.25 uM Rapamycin (Rap, Sigma-Aldrich). 

. CID: 2.5 mM Abscisic acid (ABA, AG Scientific). 

. CID: 2.5 mM Gibberellic acid (GA;, Sigma-Aldrich). 
. Dimethylsulfoxide (DMSO) 
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2.2.2 In Cellulo 
Expression in Mammalian 
Cells (HEK293T) 


2.3 Split Kinase 
Protein Purification 


2.4 Radioactivity- 
Based Kinase Activity 
Assay 


1. 


Dulbecco’s Modified Eagle’s Medium (DMEM) with 4 mM 
L-glutamine, 4500 mg/L glucose, 1 mM sodium pyruvate, 
and 1,500 mg/L sodium bicarbonate. 


2. Fetal bovine serum (FBS). 


3. 100x penicillin/streptomycin solution (10,000 U/mL peni- 


cillin, 10 mg/mL streptomycin) (P/S). 


4. 100x amphotericin B solution (250 pg/mL) (AmpB). 


. Phosphate-buffered saline (PBS): 5.6 mM Na,HPO,, 1.1 mM 


Na,HPO, and 154 mM NaCl. 


. 0.25% trypsin/0.53 mM EDTA solution. 
. PolyJet in vitro DNA transfection reagent (from SignaGen 


Laboratories). 


. CID: 6.25 pM Rapamycin (Rap, Sigma-Aldrich). 


9. CID: 2.5 mM Abscisic acid (ABA, AG Scientific). 


. CID: 2.5 mM Gibberellic acid (GA3, Sigma-Aldrich). 
. Dimethylsulfoxide (DMSO). 
. Lysis buffer such as Thermo Scientific mPER Mammalian 


Protein Extraction Reagent. 


. 100x Protease inhibitor cocktail from Sigma-Aldrich. 


. 100x Phosphatase inhibitor cocktail from Santa Cruz, cock- 


tails A and B. 


. Bicinchoninic acid protein quantification kit such as Thermo 


Scientific Micro BCA Protein Assay Kit. 


. Ni-NTA agarose resin for His6-tag based purification such as 


Macherey-Nagel. 


. Buffer A: 20 mM Tris, 250 mM NaCl, 20 mM imidazole, 


pH 8. 


. Buffer B: 20 mM Tris, 250 mM NaCl, 20 mM imidazole, 


pH 7. 


. Kinase Assay Buffer: 20 mM MOPS, 1 mM EDTA, 10 mM 


MgCl, pH 7. 


. Kinase Assay Buffer: 20 mM MOPS, 1 mM EDTA, 10 mM 


MegCh, pH 7. 


. Appropriate kinase peptide substrate such as Srctide, Lyntide, 


Kemptide, and PKAtide. These can be bought from suppliers 
such as SignalChem or synthesized. 


7. Adenosine triphosphate (Cold ATP, Sigma-Aldrich). 


. *P-labeled ATP (Hot ATP, PerkinElmer). 


9. P81 phosphocellulose paper (EMD Millipore). 


10 


0.85% Phosphoric acid. 


2.5 Instruments 
and Equipment 


3 Methods 


3.1 Plasmid 
Construction/Cloning 


3.2 Split Kinase 
Protein Expression 


3.2.1 In Vitro Expression 
in Rabbit Reticulocyte 
Lysate 
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Acetone. 


Cocktail for liquid scintillation counting such as Budget Solve 
Complete Count Cocktail. 


. Microcentrifuge. 

. Heat block. 

. Cell incubator. 

. Cell hood. 

. Microscope for cell counting. 


. Scintillation counter. 


. Choose an appropriate site to split the kinase of interest. We have 


utilized a dissimilarity based approach using sequence alignment 
of several members of a particular kinase group. (see Note 1). 


. Choose desired CID system for your split kinase, either under 


the control of rapamycin, abscisic acid, or gibberellic acid, 
which heterodimerize the protein pairs FRB/FKBP, ABI.,’/ 
PYL,, and GAI(92)/GID1, respectively. 


. Using standard cloning techniques (see Note 2), attach the 


N-Terminal portion of the kinase (NTerm) to one of the CID 
interacting proteins (CID1), and the C-Terminal portion of 
the kinase (CTerm) to the other CID interacting protein 
(CID2), to get the final NTerm-Linker-CID1 and CID2- 
Linker-CTerm constructs (see Note 3). Make sure you use the 
appropriate vector for your intended purpose (see Note 4). 


. Design one of your split kinase halves to include a His6-tag for 


protein purification purposes. We commonly incorporate this 
tag at the end of the kinase CTerm (CID2-Linker-CTerm- 
His6) (see Note 5). 


. In order to use the in vitro rabbit reticulocyte lysate system, 


in vitro translation (IVT) PCR linear products are generated 
using the appropriate plasmid as templates, and using a for- 
ward primer that incorporates a RNA polymerase promoter, 
such as T7, and a mammalian Kozak sequence, and a reverse 
primer that incorporates a stabilizing RNA stem loop. 


. With the appropriate IVT PCR linear products in hand, gener- 


ate mRNA products using a commercial RNA production sys- 
tem, such as Promega’s T7 RiboMAX Large Scale RNA 
production system. Transcribe 3 ppg of IVI-PCR products 
using a RNA polymerase for 4 h at 30 °C (see Note 6) in a final 
volume of 25 pL, with the provided reaction buffer and rNTPs. 
mRNA is further purified using G50-microcolumns. 
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3.2.2 In Cellulo 
Expression in Mammalian 
Cells (HEK293T) 


3. Using the generated mRNA products, express proteins using a 


RRL system: Mix 1 pmol of mRNA with 0.25 pL of 1 mM 
amino acid mixture, 17.5 pL of RLL, 1 pL of 25-fold solution 
of the desired CID (either 6.25 uM Rap, 2.5 mM ABA or 
2.5 mM GA; in 1% DMSO), and nuclease-free water to a final 
volume of 25 pL. The final DMSO concentration in the mix 
will be 0.04%. Incubate at 30 °C for 90 min (see Note 7). 


. Once protein expression is complete, proceed to Subheading 


3.3 for protein purification and subsequent testing of kinase 
activity. If desired (optional), the expression of the split kinase 
proteins can be simultaneously performed using an in vitro 
protein expression reaction using **S-methionine to label your 
protein of interest and subsequently running the samples on a 
SDS-PAGE gel and detection by autoradiography. The proce- 
dure for this step is out of the scope of this chapter, and can be 
found elsewhere [32]. 


. Maintain HEK293T cells in DMEM media supplemented with 


10% FBS, 1x P/S solution, and 1x AmpB in a 5% CO) incuba- 
tor, at 37 °C. 


. Plate HEK293T cells at a density of 1 x 10° cells per well in 


6-well-plates, and incubate at 37 °C for 18-24 h prior to 
transfection. 


. Co-transfect cells with 1 pg of total plasmid DNA. Typically, 


for split constructs, 0.5 jg of each plasmid is co-transfected 
and compared to transfections with 0.5 pg of wild-type kinase 
and 0.5 pg of empty vector using a PolyJet transfection reagent 
or equivalent according to manufacturer protocols (see Note 
8). In brief, replenish media in cells plated in 6-well-plates, 1 h 
before transfection; mix DNA plasmids in 50 pL of DMEM 
media, and separately mix 3 pL of PolyJet reagent in another 
50 pL of DMEM media; rapidly add the PolyJet solution to 
the plasmid DNA solution, mix 3-4 times, and incubate at 
room temperature for 15 min. Add the resulting 100 pL of 
transfection mix to each well in the 6-well plates; gently mix 
the media in the 6-well plates and incubate at 37 °C for 10 h. 


. 10 h post-transfection, remove transfection complexes, and 


replenish wells with DMEM /0.1% FBS/1x P/S/AmpB media 
(see Note 9) containing either 80 nM rapamycin/0.1% DMSO 
or 100 pM ABA/0.1% DMSO or 100 pM GA;/0.1% DMSO 
and 0.1% DMSO as a negative control. Incubate cells with 
CID for 24-36 h at 37 °C (see Note 10). 


. 34-46 h post-transfection, wash each well with 500 pL of PBS, 


and lyse cells with 100 pL of lysis buffer (mPER supplemented 
with 1x protease and phosphatase inhibitors, and with the 
appropriate CID in activated samples, typically, 250 nM Rap, 
100 pM ABA or 100 uM GAs. Incubate cells with mPER solu- 


3.3 Split Kinase 
Protein Purification 


3.4 Kinase Assay 
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tion for 20 min at 4 °C. Cell lysate is then cleared at 14,000 rcf 
for 10 min at 4 °C. 


. Quantitate total protein concentration in each sample using 


BCA reagent, according to protocol: Mix 5 pL of each sample 
with 245 pL of 10% mPER in water, and 250 pL of BCA 
reagent. Incubate samples at 60 °C for 30 min and then mix 
200 pL of each sample with 800 pL of water in a plastic spec- 
trometer cuvette. Read absorbance at 562 nm and compare 
with a standard curve of 0—-0.05 pg BSA/UL. 


. Take 150 pg of total protein from each sample and make up 


the volume to 100 pL using mPER solution. Proceed to 
Subheading 3.3 for protein purification and subsequent testing 
of kinase activity. 


. All of the buffers used in this procedure should be supple- 


mented with the appropriate CID (250 nM Rap, 100 pM 
ABA, or 100 pM GAs) in the activated samples. 


. For in vitro translated samples, dilute the 25 pL of RLL after 


translation with 75 pL of Buffer A, for a final volume of 
100 pL. The 27 cellulo translated samples are already at a vol- 
ume of 100 pL. 


. For each sample, add 5 pL of Ni-NTA agarose resin (Macherey- 


Nagel) and 100 wL of Buffer A in a microcentrifuge tube (see 
Note 11), and equilibrate for 30 min at 4 °C. Spin down the 
resin at 1530 rcf for 1 min and remove 75 pL of supernatant 
(see Note 12). 


. Add the 100 uL of cell lysate either from in vitro or im cellulo 


translation and bind to the resin for | h at 4 °C. Gently spin down 
the resin (1530 rcf/1 min) and remove 100 pL of supernatant. 


. Wash the resin for 4 min with 100 pL each time with the fol- 


lowing buffers: 1x Buffer A, 3x Buffer B, 1x Kinase Assay 
Buffer (see Note 13). 


. After the last wash, remove a total of 120 pL of supernatant, 


leaving a total volume of 10 pL in the microcentrifuge tube 
(see Note 14). 


Add 10 pL of an appropriate peptide substrate solution (see 
Note 15) and incubate at RT for 30 min. 


. Add 10 pL of a 300 uM ATP solution (83.3 nCi/pL, equivalent 


to adding !/24 of hot *P-ATP PerkinElmer BLU002250UC) 
and incubate at room temperature for 4 h (see Note 16). 


. Spot the reaction mixture after incubation on P81 paper, and 


then wash the paper for 3 min 3x with 500 mL of 0.85% 
phosphoric acid and 1x with 500 mL of acetone. 


. Immerse the P81 paper in 10 mL of scintillation cocktail in a scin- 


tillation vial and measure counts using a scintillation counter. 
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4 Notes 


5. Compare your activated split kinase samples (+CID), non- 


activated ones (—CID), as well as the negative control, for 
example, RRL or cell lysate only, to confirm that the split 
kinase is small molecule dependent. 


1. We have found three general fragmentation sites for tyrosine 


kinases, equivalent to Lyn kinase 298/299, 311/312, and 
393/394 sites (using the numbering system in UniProt 
P07948) [25]. These are a good starting point to create a 
working split kinase system for protein tyrosine kinases. 


. In brief, a standard cloning procedure consists of generating 


desired inserts by PCR amplification using appropriate oligo- 
nucleotide primers, followed by digestion with appropriate 
restriction enzymes. A selected vector is digested using the 
same restriction enzymes, and the insert and vector are 
subsequently ligated using a DNA ligase. The ligated product 
is then used to transform competent cells, grow colonies, 
purify plasmid DNA in those colonies, and finally confirm the 
intended construct by DNA sequencing. 


. We most commonly use the NTerm-CID1 and CID2-CTerm 


arrangement, but different arrangements and orientations should 
be tried to optimize a particular split kinase system. The most 
practical way to generate these constructs is by creating a vector 
with two multiple cloning sites (MCS) separated by a linker 
(MCS-Linker-MCS). The CID interacting proteins can then be 
cloned to generate general CID vectors, such as MCS-Linker- 
CID1 and CID2-Linker-MCS. We have used different linker 
lengths (13, 17, 21 and 25aa, with GGGS repeats), and several 
should be tried to optimize a particular split kinase system. 


. We most commonly use the pRSFDuet-1 vector from Novagen 


and the pcDNA3.1(+) vector from Invitrogen. Both of them 
can be used to produce protein using in vitro methods, while 
only the pcDNA3.1(+) vector is used to express proteins in 
mammalian cells. 


. We have previously placed the His6-tag at the N-termini of the 


constructs, with equally successful results. Other purification 
tags, such as FLAG-tag, can also be used in place of the 
His6-tag. 


. If mRNA yields are low at 4 h and 30 °C, yields may improve 


with 4 h at 37 °C. 


. A ribonuclease inhibitor, such as RNasin, can be added to the 


RLL reaction to prevent mRNA degradation. If the RLL sys- 
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tem does not express your split kinase system efficiently, a 
wheat germ extract system can also be used. 


. If this transfection conditions do not work for your particular 


split kinase sensor, the so-called hard-transfection conditions 
using 2 pg of total plasmid, and trypsinizing and pelleting cells 
before incubation with transfection mix can be tried. Specific 
procedures for hard transfection can be found for different 
commercial suppliers. 


. FBS can sometimes interfere with small molecule permeability. 


Thus, a 0.1% FBS media instead of standard 10% FBS is recom- 
mended at the step where the CID is added. 


Incubation of cells with CID should be optimized as a func- 
tion of time. Maximum expression levels of functional split 
kinases are usually reached at 24 h after addition of CID. 


. Using smaller microcentrifuge tubes (such as 0.2 or 0.6 mL) 


makes it convenient for removal of supernatant without dis- 
turbing the Ni-NTA agarose resin. 


The 25 pL or supernatant left in the microcentrifuge tube is 
necessary to perform subsequent washes without disturbing 
Ni-NTA agarose resin. The excess supernatant is removed at 
the end of the washes, prior to the kinase activity assay. 


For washes, removing supernatant, adding fresh buffer, doing 
a quick gentle vortex, and then incubating for 4 min before 
spinning down is recommended. 


Ni-NTA agarose resin is hygroscopic and absorb buffer and 
thus at the end of the washes there is usually less than the 
25 pL extra volume added at the beginning. Make sure to 
remove the same amount of supernatant in all samples without 
disturbing the resin, since the results for the kinase activity 
assay are relative to how each sample is handled. 


Different kinases phosphorylate different commercial peptide 
substrates. Find an appropriate substrate for your split kinase 
of interest and validate all protocols with the full-length or 
catalytic domain only and conduct experiments at the reported 
K,, or independently verify K,, of the parent kinase prior to 
studies with the split kinase. 


We thank members of the Ghosh lab, Dr. Reena Zutshi, and 
Luceome Biotechnologies for helpful comments and reagents. 
Indraneel Ghosh is the CSO at Luceome Biotechnologies, Tucson, 
AZ. We thank the NIH (1RO01GM115595-01) and NSF (CHE- 
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Chapter 20 


Directed Evolution Methods to Rewire Signaling Networks 


Raphaél B. Di Roberto, Benjamin M. Scott, and Sergio G. Peisajovich 


Abstract 


The ability to sense and process cues about changing environments is fundamental to life. Cells have 
evolved elaborate signaling pathways in order to respond to both internal and external stimuli appropri- 
ately. These pathways combine protein receptors, signal transducers, and effector genes in highly con- 
nected networks. The numerous interactions found between signaling proteins are essential to maintain 
strict regulation and produce a suitable cellular response. As a result, a signaling protein’s activity in isola- 
tion can differ greatly from its activity in a native context. This is an important consideration when study- 
ing or engineering signaling pathways. Fortunately, the difficulty of studying network interactions is fading 
thanks to advances in library construction and cell sorting. In this chapter, we describe two methods for 
generating libraries of mutant proteins that exhibit altered network interactions: whole-gene point muta- 
genesis and domain shuffling. We then provide a protocol for using fluorescence-activated cell sorting to 
isolate interesting variants in live cells by focusing on the unicellular eukaryotic model organism 
Saccharomyces cerevisiae, using as an example recent work that we have done on its G protein-coupled 
receptor Ste2. 


Key words Signaling pathway, Protein network, Directed evolution, Domain shuffling, Random 
mutagenesis, Error-prone PCR, Fluorescence-activated cell sorting 


1 Introduction 


Cells must constantly respond to changes in their environment to 
fulfill their role or simply to survive. It is the function of signaling 
pathways to detect these changes and to decide on an appropriate 
response. These pathways are composed of a variety of proteins 
and small molecules interacting together and forming highly con- 
nected networks. Signaling proteins include extracellular or intra- 
cellular receptors, small molecules, modifiers (kinases, methylases, 
proteases, etc.) and transcription factors, among others. 

Due to their complex roles in signal transmission and regula- 
tion, signaling networks are especially rich in protein-protein 
interactions. These interactions encompass simple binding, or scaf- 
folding, which ensures that effectors and their targets are in close 
proximity, as well as more complex regulatory interactions that can 
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control protein activity [1]. Importantly, protein interactions are 
highly context-dependent. Each interaction partner must be pres- 
ent in threshold amounts and in the right state or conformation, 
and this may depend on the presence of a third partner at an allo- 
steric site. As such, a signaling protein’s activity in its natural con- 
text can dramatically differ from its activity in vitro or even in a 
heterologous expression system. These possible discrepancies can 
lead us astray when studying signaling networks as the sum of indi- 
vidual parts rather than as a complex system of functional interde- 
pendence. Conversely, changes in network interactions, or 
“rewiring,” can alter network output in myriad ways, from simple 
changes in signal dynamics to more complex changes in input/ 
output relationships [2]. 

Despite their significance, network interactions are often over- 
looked when studying protein function and evolution; further- 
more, they are also under-utilized in protein engineering. 
Functional changes during protein evolution are typically studied 
by either comparing proteins from related organisms, or by intro- 
ducing mutations in a gene and observing the consequences of 
those changes in the immediate vicinity of the function being ana- 
lyzed. For example, when studying the evolution of a signaling 
kinase, one could analyze how changes in the amino acid sequence 
of that kinase affect binding to upstream activators, downstream 
substrates, scaffolds, or other regulators. A conceptually different 
approach is to investigate how those changes affect the function of 
the network, rather than the function of an individual gene. For 
that, it is necessary to look beyond the direct interactions between 
the protein being analyzed and its close network neighbours. A 
particularly powerful approach for studying how changes in an 
individual gene affect network function consists in combining ran- 
dom mutagenesis with selection, a procedure known as directed 
evolution [3]. To take full advantage of this approach, the method 
of mutagenesis, as well as the context of selection, should be care- 
fully considered. As such, we provide here some guidelines for the 
directed evolution of signaling proteins in a way that emphasizes 
how changes in individual components affect overall network func- 
tion. We then follow with detailed protocols for mutant library 
creation and selections, focusing on how different types of muta- 
genesis will allow us to explore very different evolutionary trajec- 
tories. These are written with the yeast mating pathway in mind, an 
example of a highly connected signaling pathway, but they can be 
used for other contexts as well. 

In any evolutionary process, a source of variability is needed, 
and so directed evolution begins with gene mutagenesis. This pro- 
cess has improved considerably since the use of chemical and physi- 
cal mutagens. Today, these methods have been supplanted by a 
wide range of approaches, ranging from error prone PCR [4] , in 
which random mutations are introduced in a specific gene, to more 
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complex strategies based on DNA recombination [5], in which 
large DNA fragments (e.g., whole genes, specific protein domains, 
or other DNA regulatory regions) are swapped among different 
library variants. While these approaches are capable of generating 
billions of unique mutant genes (or gene combinations), this 
potential is much reduced by the necessary vector cloning and cell 
transformation/transfection steps that follow. Both steps have lim- 
ited efficiencies that significantly reduce the size of the mutant 
library that can be generated. Furthermore, the introduction of 
random point mutations throughout an entire gene inevitably 
results in a large pool of inactive or unchanged mutants [6], which 
can be seen as wasted potential. This problem has often been reme- 
diated at least in part by restricting the region of mutagenesis. In 
this way, known structural features are avoided and the region of 
interest is mutated more thoroughly. However, this solution comes 
with bias, as catalytic sites are typically targeted because they are 
often well defined and perceived as key to changing function [7, 
8]. This neglects other possible functional changes, including net- 
work rewiring events which are rarely well characterized. 

To ensure that network interactions are not excluded, it is thus 
recommended that targeting specific gene regions be kept to a 
minimum, especially in the absence of knowledge on the protein’s 
interaction motifs. Instead, it is better to increase the efficiency of 
cloning and transformation. As transformation efficiency is strongly 
cell- and vector-dependent, we will not discuss it further. Instead, 
we will focus on the mutagenesis and cloning steps. In our first 
protocol, we will describe the construction of a library of random 
point mutants through a combination of error-prone DNA poly- 
merase, high-fidelity polymerase and type IIS restriction enzymes. 
The latter enzymes, reviewed in [9], are notable for their ability to 
cut DNA downstream of a recognition sequence. Though they are 
useful for scar-free, one-directional cloning, perhaps their greatest 
advantage comes from the possibility to design custom non- 
palindromic overhangs. These can prevent self-ligation between 
inserts (illustrated in Fig. 1) and can therefore dramatically increase 
the efficiency of proper ligation. 

Alternatively, one can avoid point mutagenesis altogether by 
focusing instead on protein domains. Many proteins consist of 
functional units, called domains, which can fold independently of 
each other and accomplish self-contained functions. This modular- 
ity allows different domains to be joined into new proteins and is 
thought to be an essential source of novelty in natural evolution 
[10]. For instance, in the yeast mating pathway, protein kinases 
and scaffolds involved in signal transduction are composed of one 
or more interaction domains [11]. When these domains are manu- 
ally “shuffled” with each other, new functional proteins are 
generated and these can confer a novel response to the overall sig- 
naling pathway [11-13]. By promoting novel interactions between 


Aarl Recognition Site and Endonuclease Activity Overhangs tailored for 
multi-part domain shuffling cloning 
7 ~ HH GGAG 
recognition overhang 
5... CACCTGC _NNNN:NNNN NNN... 3° Wi ccctT 
3’... GTGGACG NNNN NNNN:NNN ... 5’ 
spacer t —™ GCGA 


cut 
HM TGCG 


Domain Identification Aarl digestion of 
Acceptor Plasmid 


Protein A [Domain Ai] Domain A2 [Domain A3] [Domain Ai] Domain A2 [Domain A3] 
Protein B | Domain B1] B1 | Domain B2 | B2 


a 


Addition of flanking overhangs by PCR 


& "4 


Multi-Insert Aarl-based cloning resulting in 
Domain Shuffling Libraries 


Or? 


Fig. 1 Aarl-based multi-insert cloning and domain shuffling. (a) Aarl recognizes a 7 bp sequence but cuts 4 and 
8 bases away from it. Because Aarl is not sensitive to the specific sequence at its cutting sites, it is possible to 
design non-palindromic overhangs that will reduce the chances of insert (or plasmid) self-ligation, there by 


2 Materials 


2.1. Error-Prone PCR 
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unchanged catalytic units, domain shuffling is a powerful way to 
rewire a protein network and gain understanding of how those 
interactions contribute to signaling. Of course, this approach is 
dependent on the presence of clear domain boundaries, and public 
databases are helpful for identifying these. Here, we provide a pro- 
tocol for the construction of a library of shuffled domains. This 
protocol also takes advantage of the unique properties of type IIS 
restriction enzymes in order to efficiently assemble multiple pro- 
tein domains in a defined order. 

Once a mutant DNA library has been generated, it must be 
expressed and screened in order to analyze the resulting diversity 
of phenotypes. Because this screening step is often conducted 
in vitro or in a nonnative context (via heterologous expression in 
E. coli or yeast), mutations affecting network interactions can be 
missed due to the lack of native protein partners. As such, it is pref- 
erable to conduct the assay in a native or near-native context or in 
an environment where the relevant protein partners are exoge- 
nously supplied. Furthermore, if a specific function is desired, a 
selection step is necessary to isolate promising mutants. Importantly, 
the phenotype selected should reflect pathway output rather than 
an individual protein’s activity. Our third protocol describes how 
to select interesting mating pathway mutants using fluorescence- 
activated cell sorting (FACS) in yeast. Together with efficient 
library cloning and protein domain shuffling, this strategy can 
reveal the contribution of network rewiring in protein and pathway 
function, or unlock its potential for applied protein engineering. 


1. DNA primers for error-prone PCR and amplification PCR 
reactions (see Note 1). 


. Target DNA (on plasmid). 

. 40 mM dNTP mix (10 mM each dNTP). 

. Mutazyme II DNA polymerase (Agilent). 

. 10x Mutazyme II reaction buffer (Agilent). 


ann PF W WN 


. DpnI enzyme. 


ee 
Fig. 1 (continued) significantly increasing the efficiency of the multi-insert cloning step. (b) The identification 
of domain boundaries via protein databases is followed by a PCR amplification step that adds Aarl recognition 
sequence and specific overhangs designed for sequential ligation. The domain library is then ligated into a 
vector for which the overhangs match those of the first and last domains that will form shuffled proteins, 
resulting in the creation of a large domain-shuffled gene library. Note that an individual domain could be 
included at multiple positions in the final multi-domain protein, simply by altering the identity of the flanking 


overhangs 
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2.2 Aarl Mediated 
Domain Shuffling 


2.3 Yeast-Based 
Sorting of Rewired 
Pathway Interactions 


3 Methods 


3.1 Error-Prone PCR 


7. QIAquick PCR Purification Kit (Qiagen). 


. Pfu DNA polymerase. 


9. Acceptor plasmid DNA. 


N QO oF #& w 


. Antarctic phosphatase. 

. T4 DNA ligase. 

. DH5a competent E. coli cells (High Efficiency). 

. LB liquid and agar media containing appropriate antibiotic. 


. QTAprep Spin Miniprep Kit (Qiagen). 


. Acceptor plasmid DNA. 
. Donors’ DNA: either genomic DNA containing targeted 


genes, or plasmids carrying the cloned targeted genes. 


. DNA primers for PCR reactions. 

. Pfu DNA polymerase. 

. QTAquick PCR Purification Kit (Qiagen). 
. AarI restriction enzyme. 


. 50x (0.025 mM) oligonucleotide containing AarI recognition 


sequence. 


. AarI 10x reaction buffer. 

. Antarctic phosphatase. 

. 10x Antarctic phosphatase buffer. 

. T4 DNA ligase. 

. 10x T4 DNA ligase buffer. 

. DH5a competent E. coli cells (high efficiency). 


. Synthetic drop-out medium (liquid and solid). 
. Mutant plasmid library. 


Round-bottom tubes with 35 jm cell strainer caps (BD). 


.10mM K. /actis w-factor pheromone. 
. 10 mg/mL cycloheximide. 
. Zymoprep™ Yeast Plasmid Miniprep II (Zymo Research). 


. Prepare a stock solution of Target DNA in water. The amount 


of Target DNA depends on desired mutation frequency (see 
Table 1). As an example we use the yeast G protein-coupled 
receptor Ste2. 


. Prepare a 50 pL reaction as follows: 


Table 1 
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Summary of mutation frequency as a function of target DNA 


Target Gene (S. cerevisiae) 


Target DNA (ng) 100 250 500 
Number of cycles 20 20 20 
Mutant sequences analysed 9 10 8 
Mean substitution frequency per kb 4.4 Bell DY) 
Mean insertion frequency per kb 0.0 0.0 0.0 
Mean deletion frequency per kb 0.3 0.1 0.2 
Mean mutation frequency per kb (see Note 2) 4.7 3.2 3.1 


*The amount of template indicated is the amount of Target DNA to be amplified, not the total amount of DNA tem- 


plate to add to the reaction 


5 pL of 10x Mutazyme II reaction buffer. 
1 pL of 40 mM dNTP mix. 
0.5 pL of PCR primer mix (250 ng/pL of each PCR primer). 


1 pL of Mutazyme I] DNA polymerase (2.5 U/pL). 
1-15 pL of target DNA (see Table 1). 
Water to a final total volume of 50 pL. 


. Place reaction in thermocycler, using the program depicted in 


Table 2. 


. To digest original plasmid containing target DNA, add 1 pL 


DpnI enzyme to the error-prone PCR mixture. Incubate at 
37 °C for 3h. 


. Purify the error-prone PCR reaction using a PCR purification 


kit. Elute with no more than 30 pL water or the provided elu- 
tion buffer. 


. To amplify the Error-Prone PCR product, prepare two 50 pL 


reactions as follows: 

5 pL of 10x Pfu buffer with MgSOy,. 

1 pL of 10 mM dNTP mix. 

2.5 pL of forward PCR primer (10 pM). 
2.5 wL of reverse PCR primer (10 pM). 

1 pL of Pfu DNA polymerase (2.5 U/pL). 


5 wL of purified error-prone PCR product. 
33 pL Water. 
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Table 2 

PCR cycling protocol for error-prone PCR with mutazyme 
Segment Number of cycles Temperature Duration 
1 1 O15 AG 2 min 
2 20-30, see Table 1 95°C 30s 


Primer Ty—5 °C —-_-: 30 


72°C 1 min (<1 kb 
targets) or 1 min/ 
kb (>1 kb targets) 


3 1 TANG 10 min 
Table 3 
PCR cycling protocol for amplifying error-prone PCRs with Pfu DNA 
polymerase 
Segment Number of cycles Temperature Duration 
1 1 OF 2 min 
2 30 OC 30s 
Primer T,,—5 °C 30s 
TAXES 2 min/kb 
5 1 WANS 10 min 
7. Place reactions in thermocycler, using the program depicted in 


10. 


11. 


12. 


Table 3: 


. Combine amplification PCR reactions and purify using a PCR 


purification kit. 


. Digest >4 pg of purified amplification PCR product, and >4 pg 


of desired acceptor plasmid with appropriate restriction 
endonuclease(s). See Subheading 3.2 step 7 for recommended 
AarlI digestion protocol. 


Dephosphorylate the ends of the linearized acceptor plasmid 
by incubation with Antarctic phosphatase at 37 °C for 1 h to 
reduce self-ligation background. 


Purify digested amplification PCR product and acceptor plas- 
mid with a PCR purification kit. Elute with no more than 
30 pL water or provided elution buffer. 


Ligate purified digested products using T4 DNA ligase. The 
number of ligation reactions to perform depends on the desired 
size of the library (see Table 4). Use 150 ng digested acceptor 
plasmid and a 3 molar excess of digested amplification PCR 
product per 20 pL reaction. Incubate at 16 °C overnight. 


3.2 Aarl Mediated 
Domain Shuffling 
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Table 4 

Setup for ligation reaction 
Ligation reaction volume (5 x 20 pL) 100 pL 
Competent E. coli cells volume (20 x 50 pL) 1000 pL 


Expected number of E. coli colonies (i.e., library size) 70,000 colonies 


13. Transform 50 pL competent E. coli cells with 5 pL ligation 
reaction. Plate on LB agar media containing the appropriate 
antibiotic, and incubate at 37 °C overnight. 


14. Randomly select at least ten colonies, and culture each over- 
night in lquid LB media containing the appropriate 
antibiotic. 


15. Prepare plasmid DNA mini-preparations from overnight cul- 
tures, and sequence the inserted mutant gene to determine the 
mean mutation frequency. 


16. If the desired mutation frequency has been established, trans- 
form the remaining ligation reaction volume into competent 
E. coli cells, with 5 pL ligation reaction per 50 pL cells (see 
Table 4). 


17. The total number of plated E. co/z colonies is the approximate 
diversity of your plasmid library. Add 1.5 mL liquid LB media 
to each plate of transformed E. coli cells, sterilely scrape cells, 
and pipette into the same tube. 


18. Prepare plasmid DNA mini-preparations, using one QJAprep 
Spin Column per plate of E. col cells. Elute half the columns 
each with 35 pL elution buffer. Then pass this elutate over the 
remaining columns. Combine all mini-preparations to create 
the final plasmid library (see Note 3). 


Aarl is a Type IIS restriction enzyme that recognizes a 7 bp 
sequence (CACCTGC), cutting 4 and 8 bp away from the recog- 
nition site, leaving a 4 base overhang, as show in Fig. 1. The fact 
that AarI is agnostic to the sequence of the overhangs means that 
users can “design” those sequences, thus enabling scarless cloning 
or, more importantly in this case, allowing the use of NON- 
PALINDROMIC overhangs. This property is key for multi-insert 
cloning, as ligation of multiple fragments flanked by non- 
palindromic overhangs is orders of magnitude more efficient than 
the ligation of fragments with palindromic ends (as palindromic 
overhangs can self-ligate, decreasing the ligation efficiency dramat- 
ically). The steps required to generate libraries of shuffled domains 
are outlined below. 


1. Identify domain boundaries for the genes of interest from 
databases (e.g., UniProtKB /Swiss- Prot). 
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Table 5 
Summary of 4 bp overhangs 
Overhang name DNA sequence 
A GGAG 
B CCCT 
Cc GCGA 
D TGCG 
2. Design PCR primers to amplify protein domains with appropri- 


ate “overhang” sequences and AarI recognition sequences that 
will result in the multi-part assembly of a domain-shuffled gene. 
In Fig. 1, we show the overhang design strategy for a three- 
domain shuffling library. In this case, we have 4 distinct over- 
hangs, of the following sequences as summarized in Table 5: 


As an example, here we show in detail the design of four prim- 
ers, one pair amplifying a domain flanked by “A” and “B” 
overhangs, and one domain flanked by “B” and “D” over- 
hangs. These two fragments could then be joined as an “AB” 
and “BD” pair. 


Primers for “AB” flanked domain: 


Forward primer: 5'-8 random bases—CACCTGCACAAGG 
AG—18-20 bases annealing to the 3' end of your gene fragment 
of interest-3' (note that for the first fragment, the primer should 
contain an ATG start codon). 


Reverse primer: 5’-8 random bases—CACCTGCGTTCAG 
GG— 18-20 bases annealing to the 3' end of your gene fragment 
of interest-3' (note that for multi-part shuffling it is fundamen- 
tal to ensure that all three parts be in frame). 


Primers for “BD” flanked domain: 


Forward primer: 5’-8 random bases—CACCTGCACAACC 
CT— 18-20 bases annealing to the 5' end of your gene fragment 
of interest-3' (note that as this will be an internal domain, there 
is no need to include a START codon). 


Reverse primer: 5’-8 random bases—CACCTGCGTTCCG 
CA— 18-20 bases annealing to the 5' end of your gene fragment 
of interest-3' (note that, as above, additional bases will be 
needed to ensure that all fragments are in frame and since this 
will be the last domain in the shuffled protein, you will need to 
include a STOP codon as well). 


. Perform the PCR reactions using a high fidelity polymerase, 


such as Pfu, as indicated by the manufacturer. Note that the 
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protocol listed here can be used to clone three consecutive 
domains leading to a single three-domain protein, or to clone 
libraries in which each “AB,” “BC,” or “CD” fragment is actu- 
ally a large collection of different domains sharing the same 
overhangs. In the latter case, the result is not a single three- 
domain protein, but rather a library of multiple different com- 
binations of three domains arranged in the desired orders. This 
order is determined by the presence of consecutive matching 
overhangs (e.g., “AB” will ligate upstream of “BC”). 


4, Clean the PCR reactions using the QIAquick PCR cleanup kit 
as indicated by the manufacturer. 


5. Determine the concentration of each amplified fragment 
library by measuring absorbance at 260 nm using a Nanodrop 
spectrophotometer (or equivalent instrument). This step is 
key, as Aarl digestion is sensitive to DNA concentration. 


6. For Aarl-mediated multi-insert cloning of domain shuffled 
libraries, it is necessary to use an acceptor vector flanked by 
Aarl recognition sequences with the first and last overhangs 
used (“A” and “D” in our case), as show below: 


5’... PROMOTER. ..GGAGCAAGGCAGGTG...(~20 bp 
intermediate sequence)... CACCTGCAACATGGG... 
TERMINATOR...3’ 


Note the flanking overhangs in bold, and the AarI recognition 
sequences (in opposite directions) underlined. 

7. Set up individual AarI digestion reactions for each PCR- 
generated DNA fragment library, as well as for the acceptor 
plasmid DNA. For each reaction, set up the components as 
depicted in Table 6, mix gently and incubate at 37 °C for 3 h: 

8. Inactivate AarI by incubation at 65 °C for 20 min. 


9. Purify digested DNA (both plasmid and PCR fragments) 
directly with QIAquick PCR clean up kit. Note that as long as 
the fragment released from the acceptor vector is small (up to 
20 bp) there is no need for gel purification. 


Table 6 
Setup for Aarl restriction enzyme reaction 


Component Amount 


DNA (either acceptor plasmid OR PCR fragment) 5 pg 


10x AarlI reaction buffer 6 pL 
50x Aarl oligonucleotide (0.025 mM) 1p 
AarlI enzyme 2.5 WL 


Water To complete 60 pL 
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10. 


11. 


12. 


13. 


14. 


Dephosphorylate the ends of the linearized acceptor plasmid 
by incubation with Antarctic phosphatase at 37 °C for 1 h to 
reduce self-ligation background. 


Determine the concentration of each digested DNA fragment 
by measuring absorbance at 260 nm using a Nanodrop spec- 
trophotometer (or equivalent instrument). Once again, this 
step is key, as multi-insert ligations are very sensitive to the 
relative concentration of all fragments being ligated. We have 
found that a molar ratio of 2:1:1:1 (where “2” is the acceptor 
plasmid) is efficient for 3-insert ligations. 


Set up ligation reactions as depicted in Table 7 (for a 20 pL 
reaction) and incubate the reaction for 4 h at 16 °C, followed 
by overnight incubation at 4 °C. 


Transform the ligations in highly competent cells (protocols vary 
depending on the particular bacterial cell strain used). Normally, 
chemically competent cells with efficiencies of ~10* CFU are 
suitable. The target number of colonies depends on the theoreti- 
cal size of the library being generated. For example, while in 
principle a single colony will suffice for a single 3-insert ligation 
(though several more would obviously be preferred), a library in 
which “N1” number of inserts are shuffled at position “AB,” 
“N2” number of inserts are shuffled at position “BC,” and “N3” 
number of inserts are shuffled at position “CD,” will have a theo- 
retical size equal to “N1 x N2 x N3”. In this case, it is good to 
have at least ten times more colonies than the expected library 
size to ensure that all variants are represented. 


Transformants can be screened by colony PCR, using standard 
methods. Clone identities should be determined by DNA 
sequencing. 


Table 7 

Setup for ligation reaction 
Component Amount 
Digested/dephosphorylated acceptor plasmid DNA 150 ng 


Digested insert “AB” (or mix of multiple “AB” inserts in As required (based on desired molar ratio) 


desired ratios) 


Digested insert “BC” (or mix of multiple “BC” inserts in As required (based on desired molar ratio) 


desired ratios) 


Digested insert “CD” (or mix of multiple “CD” inserts in As required (based on desired molar ratio) 


desired ratios) 
T4 DNA ligase 
10x T4 DNA ligase buffer 
Water 


Lpl 
2) (wl, 


To complete 20 pL 


3.3 Yeast-Based 
Sorting of Rewired 
Pathway Interactions 
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The following protocol is designed for the identification of mating 
pathway gene variants that enable a strong response to the peptide 
pheromone a-factor. For instance, the gene STE2, which encodes 
the pheromone receptor, can be mutated to identify changes that 
lead to mating pathway induction in the presence of a pheromone 
from the related species Kluyveromyces lactis [14]. In this system, a 
ste2A yeast strain which expresses green fluorescent protein (GFP) 
from a mating-inducible promoter is transformed with a mutant 
STE2 library and used to select active Ste2 variants. 


1. 


10. 
11. 


Transform yeast cells with the plasmid library of mutant 
DNA. We recommend using the high-efficiency lithium ace- 
tate transformation method [15]. This method can be expected 
to yield 15,000 colonies from an initial 5 mL of log-phase 
yeast culture and 1 pg of a 5-10 kbp DNA plasmid. 


. Pick 100 colonies or more and assay the mating pathway 


response of each. This pre-sorting screen will reveal the ratio of 
active to inactive mutants as well as the diversity of the active 
mutants’ phenotypes (Fig. 2a). 


. Place individual mutant colonies in 2 mL of drop-out medium. 


Also inoculate 2 mL volumes with a negative control, such as 
cells transformed with an empty vector, and a positive control, 
such as cells expressing wild-type STE2. Grow overnight at 
30 °C in a 225 RPM shaker incubator. 


. Transfer 40 pL of the overnight culture to 2 mL of drop-out 


medium. Grow this dilution to an optical density at 600 nm 
(OD¢00) of 0.4 to 0.6 (early log-phase). 


. Add «-factor pheromone to each culture to a final concentration 


of 100 nM. Cultures can also be split into two to measure the 
pathway response in the absence of pheromone. Grow for 2 h. 


. Add cycloheximide to a final concentration of 10 pg/mL to 


each culture to arrest protein expression, including GFP. 


. Sonicate each log-phase culture briefly to break large cell 


aggregates. This typically requires two sonication pulses at the 
lowest setting. 


. Run each culture in a flow cytometer to measure GFP fluores- 


cence. This requires a 488 nm laser and a 525/50 nm filter. 


. To proceed with cell sorting, combine the mutant library into 


a single liquid culture. For this, dispense 5 mL of drop-out 
medium onto each plate of transformed yeast cells and scrape 
off the colonies using a plating stick. Aspirate the mixed colo- 
nies and add them to a tube on ice. 


Vortex the colony mixture on a low setting for 30 s. 


Inoculate 50 mL of drop-out medium with 50 uL of the col- 
ony mixture. Also inoculate 2 mL volumes with the negative 
and positive controls. Grow overnight. 


SSC (A) 
FSC (H) 


FSC (A) FSC (W) SSC (W) 


b Untreated cells Cells + pheromone 


GFP (A) 


RFP (A) RFP (A) 
OC) 
Mutant library 
| 15: 
@ K. lactis pheromone 
2 No treatment 
5 10 
i 
Phenotype = 
screen 2 0.5: 
e 
Yeast transformation 
0.0: 
| Wild-type Random variants 
1.5: 
ey 
°, § 1.0 
— Fi 
IS Ce Phenotype 
yer XX screen Z pd 
© 
a 
0.0: 
First cell sorting Wild-type Sorted variants 
| , 
ey pt 
—s 3 
IS SO Phenotype @ 
ye) <x screen 2 05: 
a 
0.0 
Second cell sorting Wild-type Sorted variants 


Fig. 2 Fluorescence-activated cell sorting of yeast mutant libraries. In this example, we show sorting data for 
yeast carrying a Ste2 receptor library, with mutants conferring a mating response to K. /actis pheromone. (a) 
Gating strategy needed to ensure that most events collected correspond to single cells. The rectangular gates 
capture cells in the diagonal, which mostly correspond to single cells (cell aggregates display higher forward 


12. 


L3s 


14. 
15. 


16. 


17. 


18. 


19. 


< 
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Perform steps 4~7 to induce the mating pathway, but do not 
add cycloheximide. 


Transfer 2 mL of each culture to an empty 5 mL round bottom 
tube with a 35 pm cell-strainer cap. Dispense the liquid culture 
through this cap to further reduce aggregation. 


Run the filtered cultures in a cell sorter: 


Vortex a 5 mL round bottom tube containing 500 pL of sterile 
drop-out medium and place it in the cell sorter’s collection 
chamber. 


Set up the gating strategy outlined in Fig. 2a to target live, single 
cells. Most cell types (including bacteria, yeast, and animal cells) 
will form cell aggregates when suspended in solution. Thus, 
there is a risk that a simplistic gating strategy (e.g., selecting cells 
that display high GFP fluorescence) will be compromised by the 
presence of cell aggregates that may include cells with low fluo- 
rescence levels considered to be a single “event” by the flow 
cytometer. To minimize this risk, it is important to devise a multi- 
gate selection strategy that includes at least two gates based on 
forward and side scattering signals capable of distinguishing 
between single cells and aggregates. In the case of yeast cells, two 
gates are sufficient: one plotting the width of the forward scatter- 
ing signal vs. the height of the forward scattering signal, and a 
second gate plotting the width of the side scattering signal vs. the 
height of the side scattering signal, as shown in Fig. 2a. Similar 
strategies can be devised for other cells types, though it is impor- 
tant to be familiar with the specific aggregation propensities and 
light scattering properties of your cell of choice. 


Use your negative control to establish the cells’ baseline fluores- 
cence and your positive control to visualize the fluorescence of 
active mutants. Draw a gate that includes the latter but does not 
overlap with the former. Note that in separate experiments you 
may want to select cells that activate the pathway in the absence 
of stimulus, or only in the presence of stimulus (Fig. 2b). 


Sort the desired number of mutants and collect them in a 5 mL 
tube containing 500 pL of drop-out medium. Once the sort has 
ended, lightly vortex the collection tube to wash the walls. 

Plate the content of the collection tube onto solid medium 


(100 pL per plate). Approximately 40% of the collected events 
can be expected to be recovered this way. 


Fig. 2 (continued) and side scattering heights). (b) Gating strategies needed to select cells that activate the 
pathway response upon pheromone treatment. Note the exclusion of inactive mutants (/eff). (c) Flowchart of a 
Cell sorting experiment. A plasmid library of STE2 mutants is transformed in yeast. An initial phenotypic screen 
shows that most mutants cannot sense the pheromone of K. /actis as well as wild type. Following a first round 
of cell sorting, almost all mutants selected can sense the pheromone and about half can do so better than wild 
type. A second cell-sorting step yields an even greater proportion of strong K. /actis-responsive mutants 
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20. Runa screen as done in steps 2-8 in order to confirm that the 


21. 


sorted mutants have the desired phenotype (see Fig. 2c for an 
example with our data on S. cerevisiae Ste2 mutants capable of 
responding to a pheromone from a distantly related yeast spe- 
cies). Interesting mutants can be isolated and their plasmid 
extracted using the Zymoprep™ Yeast Plasmid Miniprep II to 
be sequenced. 


Optional: Perform iterative rounds of sorting on the sorted 
library to isolate an ever-greater proportion of desired mutants. 


. DNA primers must contain AarI restriction sites, or alternative 


endonuclease sites appropriate for inserting the product into the 
desired acceptor plasmid, followed by 18-20 bp annealing to the 
Target DNA. This annealing sequence will not be mutated. 


. Listed values are similar to those suggested by the manufac- 


turer, although several conditions can be tested simultaneously 
to achieve the desired mutation frequency. In general, using 
less Template DNA and more cycles will increase the mutation 


. If the plasmid library will be transformed into yeast, a plasmid 


DNA concentration of 1 pg/pL is recommended. The DNA 
concentration can be increased by standard precipitation with 
sodium acetate and ethanol. 
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